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Phosphonate Metabolizing Plants 

Reference to Prior Applications 

This application claims the benefit of priority to US Provisional Application 
Serial No. 60/108,763 filed November 17, 1998. r , 

Field of the Invention 

The present invention relates in general to herbicide resistance in plants, and more 
particularly to a new class of phosphonate metabolizing genes and methods of using these genes 
for improving plant tolerance to phosphonate herbicides. 

Description of the Prior Art 

Phosphorous containing organic molecules can be naturally occurring or synthetically 
derived. Organic molecules containing phosphorous-carbon (C-P) bonds are also found naturally 
or as synthetic compounds, and are often not rapidly degraded, if at all, by natural enzymatic 
pathways. Synthetic organophosphonates and phosphinates, compounds that contain a direct 
carbon-phosphorous *(C-P) bond in place of the better known carbon-oxygen-phosphorous 
linkage of phosphate esters (Metcalf et al., Gene 129:27-32, 1993). have thus been widely used 
as insecticides, antibiotics, and as herbicides (Chen et aL i! Biol. Chem. 265:4461-4471, 1990; 
Hilderbrand et al., The role of phosphonates in living systems. Hilderbrand. R.L., ed, pp. 5-29, 
CRC Press, Inc., Boca Raton. FL, 1983). Phosphonates are ubiquitous in nature, and are found 
alone and in a diversity of macromolecular structures in a variety of organisms (Jiang et al., J. 
Bacterid. 177:6411-6421, 1995). Degradation of phosphonate molecules proceeds through a 
number of known routes, a C-P lyase pathway, a phosphonatase pathway, and a C-N hydrolysis 
pathway (Wanner, Biodegradation 5:175-184, 1994; Barry et aL US Patent No. 5,463,175, 
1995). Bacterial isolates capable of carrying out these steps have been characterized 
(Shinabarger et al., J. Bacterid. 168:702-707, 1986; Kishore et aL. J. Biol. Chem. 262:12,164- 
12,168, 1987; Pipke et al., Appl. Environ. Microbiol. 54:1293-1296,1987; Jacob et al., Appl. 
Environ. Microbiol. 54:2953-2958, 1988; Lee et al. s J. Bacteriol. 174:2501-2510, 1992; Dumora 
et al., diochim. Biophys. Acta 997:193-198, 1989; Lacoste et al., J. Gen. Microbiol. 138:1283- 
1287, 1992). However, with the exception of phosphonatase and glyphosate oxidase (GOX), 
other enzymes capable of carrying out these reactions have not been characterized. 
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coli B phn operon. The cryptic phn (psiD) operon from E. coli K-12 was subsequently found to 
contain an 8-base pair insertion in phnE. The resulting frameshift in phnE not only results in 
defective phnE gene product, but also apparently causes polar effects on the expression of 
downstream genes within the operon, which prevent phosphonate utilization (Makino et al., J. 
Bacteriol. 173:2665-2672. 1991). The operon has been more accurately described to contain the 
' genes phnC-phn? ' by the work of Makino et al. Further research has been directed to 
understanding the nature of the function of each of the genes within this operon (Chen et al., J. 
Biol. Chem. 265:4461-4471, 1990; Makino et al., J. Bacteriol. 173:2665-2672, 1991; Wanner et 
al., FEMS Microbiol. Lett. 100:133-140, 1992; Metcalf et aL Gene 129:27-32, 1993; Ohtaki et 
al, Actinomyceteol. 8:66-68, 1994). In all of these efforts, the phnO gene has been implicated 
as a regulatory protein based on its similarity to other nucleotide binding proteins containing 
structural helix-turn-helix motifs. Furthermore, mutagenesis of genes in the phn operon 
demonstrated that phnO was not required for phosphonate utilization, further supporting the 
proposed regulatory function for this gene (Metcalf et al., J. Bacteriol. 173:587-600, 1991), at 
least for the phosphonates tested. Homologous phn sequences have been identified from other 
bacteria, including a ,gene substantially similar to E. coli phnO, isolated from S. griseus, using 
nucleotide sequences deduced from those in the E. coli phnO gene (Jiang et al., J. Bacteriol. 
177:6411-6421, (1995); McGrath et al., Eur. J. Biochem. 234:225-230, (1995); Ohtaki et al., 
Actinomyceteol. 8:66-68, (1994)). However, no function other than as a regulatory factor has 
been proposed for phnO. A regulatory role for phnO in the CP lyase operon has been cited 
again in a recent review (Berlyn, Microbiol. Molec. Biol. Rev. 62:814-984, 1998). 

Advances in molecular biology, and in particular in plant sciences in combination with 
recombinant DNA technology, have enabled the construction of recombinant plants which 
contain normative genes of agronomic importance. Furthermore, when incorporated into and 
expressed in a plant, such genes desirably confer some beneficial trait or characteristic to the 
recombinant plant. One such trait is herbicide resistance. A recombinant plant capable of 
growth in the presence of a herbicide has a tremendous advantage over herbicide-susceptible 
species. In addition, herbicide tolerant plants provide a more cost effective means for agronomic 
production by educing the need f - tillage to control weeds and volunteers. 

Chemical herbicides have been used for decades to inhibit plant metabolism, particularly 
for agronomic purposes as a means for controlling weeds or volunteer plants in fields of crop 
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Tanaka et al., 1986). Co-expression of a gene encoding a protein capable of neutralizing or 
metabolizing AMPA produced by glyphosate degradation would provide a substantial 
improvement over the use of GOX alone. Thus, a method for overcoming sensitivity to AMPA 
formation as a result of glyphosate degradation, or a method for resistance to AMPA when used 
as a herbicide or as a selective agent in plant transformation methods, would be useful for 
' providing enhanced or improved herbicide tolerance in transgenic plants and in other organisms 
sensitive to such compounds. 

The use of glyphosate as a chemical gametocide has been described* (U.S. Patent No. 
4,735,649). Therein, it is disclosed that glyphosate can, under optimal conditions, kill about 
95% of male gametes, while leaving about 40-60% of the female gametes capable of 
fertilization. In addition, a stunting effect was typically observed at the application levels 
disclosed, shown by a reduction in the size of the plant and by a minor amount of chlorosis. 
Thus, a major drawback of using glyphosate as a gametocide, as is generally true with most 
gametocides, is the phytotoxic side effects resulting from lack of sufficient selectivity for male 
gametes. These phytotoxic manifestations may be effectuated by AMPA production in 
transgenic plants expressing GOX after treatment with glyphosate. Therefore, it would be 
advantageous to provide a method for preventing the stunting effect and chlorosis as side effects 
of using glyphosate as a gametocide in transgenic plants expressing GOX. Furthermore, a more 
effective method would optimally kill more than 95% of male gametes or prevent male gametes 
from maturing and would leave greater. than 60% of female gametes substantially unaffected. It 
is believed that tissue specific co-expression of GOX with a transacylase gene encoding an 
enzyme capable of N-acylation of AMPA would achieve this goal. 

It has now been discovered that the E. coli phnO gene encodes an enzyme having 
transacylase, acyltransferase, or Acyl-CoA transacylase activity in which a preferred substrate is 
a phosphonate displaying a terminal amine, and in particular amino-methyl-phosphonic acid 
(AMPA). The transfer of an acyl group from an Acyl-CoA to the free terminal amine of AMPA 
results in the formation of an N-acylated AMPA. Plants are not known to acylate AMPA to any 
great extent, and some plants have been shown to be sensitive to AMPA and insensitive to acyl- 
AMPA. Thus, expression of phnO in plants would be useful in enhancing the phosphonate 
herbicide tolerance, particularly when AMPA is used as a herbicide or selective agent in plant 



WO 00/29596 



PCT/US99/27152 



6- 



transformation, and more particularly when glvnhosate i< u L . 

w il h rec „ mbinarap , ants „ e ; G0Xgen f ' S - • " - 

Summary ..the Invention 
5 Briefly therefore the present invention is directed „ „ •• - * 

and ,o raeraods of us,„ g tese g e„es and enc ded ^JJ 

**- - P— ^ is L^T^^. " 

•he compound N-arWraethyi-pbosphonic acj<| J re ,rZ 

-pounds. In addilion , lhe presew jnvemion 

wira ,ran«y,a« genM , 0 preva „ k1 ™ ~ K 

hetero-fertilization in plants. enhancing 
Among the several advantages found to be achieved by the present invent' ». „ 

gene produc,, preferably an N-acyl-rransftrase e „ 2yme The K , ' " nC ° dme 3 d " Md 
is Mm ™« od f polynucleotide sequence preferably 

=:rr™===== 

„~,i „ transcribed RNA. Expression of Die structural DNA sen„e„~ 

produces sufficient levels of the acy Werase enzyme i„ , h . „u n , , ° 
herbicide .olerance of tbe transformed plan,. ' *° "» 

seouencT " T' e,nb0di,ne '"• " m ' C,UIa ' ° NA — * - additional , 

~,ue„ce e„codut g an amino-termina, peptide sequtnce whlcb ^ „ ^ ^ ^ 



WO 00/29596 



PCT/US99/27152 



-7- 

peptide produced from translation of the structural sequence to an intracellular organelle. This 
additional coding sequence is preferably linked in-frame to the structural sequence encoding the 
acyltransferase enzyme. The amino terminal peptide sequence can be either a signal peptide or a 
transit peptide. The intracellular organelle can be a chloroplast, a mitochondrion, a vacuole, 

5 endoplasmic reticulum, or other such structure. The structural DNA sequence may also be 
' linked to 5' sequences such as untranslated leader sequences (UTL's), intron sequences, or 
combinations of these sequences and the like which may serve to enhance expression of the 
desired gene product. Intron sequences may also be introduced within the structural DNA 
sequence encoding the acyltransferase enzyme. Alternatively, chloroplast or plastid 

10 transformation can result in localization of an acyltransferase coding sequence and enzyme to the ^ 
chloroplast or plastid, obviating the requirement for nuclear genome transformation, expression 
from the nuclear genome, and subsequent targeting of the gene product to a subcellular 
organelle. 

Preferably, the recombinant plant expresses a gene encoding an enzyme which catalyzes 
is the formation of AMPA. AMPA formation can result from the metabolism of a naturally 
occurring precursor, from a precursor such as glyphosate provided to the plant, or can result 
from the formation of AMPA through some catabolic pathway. Co-expression of GOX along % 
with AMPA acyltransferase expression provides a plant which is surprisingly more resistant to 
certain phosphonate herbicides. However, one embodiment allowing plants transformed with 
20 only an N-acyltransferase to grow in the presence of AMPA or similar or related compounds 
would provide a useful selective method for identifying genetically transformed plants, callus, or 
embryogenic tissues. 

In accordance with another aspect of the present invention is the provision of a method 
for selectively enhancing or improving herbicide tolerance in a recombinant plant which has 
25 inserted into its nuclear, chloroplast, plastid or mitochondrial genome a cassette comprised of a 
polynucleotide sequence which encodes an N-acyl-transferase enzyme. 

A further embodiment encompasses the improvement of a method for selectively 
enhancing herbicide tolerance in a transformed plant expressing a GOX gene which encodes a 
glyphosate oxidoreductase enzym- expressed in the same plants in which an acyltransferase 
30 enzyme is produced. 
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Ascomycota, Basidiomycota, etc. Plant cells can be derived from any member of the Plantae 
family. 

A further embodiment of the present invention provides for a method for producing a 
plant from a tissue, a cell, or other part of a plant which was derived from a plant transformed 
with an acyltransferase gene, a phnO gene, a gox gene, a gene in which GOX and acyltransferase 
' peptides are produced from a translational fusion or a transcriptional fusion, or a polycistronic 
gene which encodes GOX and acyltransferase peptides. 

A further embodiment of the present invention provides for a method for producing 
plants which express all or a portion of a phnO gene or similar acyltransferase gene, or a GOX 
gene as an antisense gene in a tissue specific manner. 

Other aspects also include reagents such as antibodies directed to AMPA acyltransferase, 
and polynucleotides for use in identifying acyltransferase gene sequences. These reagents can be 
included in kits containing AMPA acyltransferase, polynucleotides which are or are 
complimentary to an AMPA acyltransferase gene sequence, polynucleotides for use in thermal 
amplification of an AMPA acyltransferase gene sequence, antibodies directed to AMPA 
acyltransferase for the detection of AMPA acyltransferase in the laboratory or in the field, and 
any other reagents necessary for use in kit form as well as for use in other assays contemplated 
herein. 

A further object of the present invention is to provide a method for using phosphonate 
herbicides as chemical hybridizing agents. The method allows for selective gametocidal effects 
and for the production of male sterile plants. Such plants may be engineered so that gox or 
phnO, or gox and phnO fail to be expressed in plant tissues required for reproduction, causing 
sensitivity to applied phytotoxic compounds which inhibit formation of mature gamete 
structures. 

Brief Description of the Drawings 

Figure 1 illustrates a [ H C] isotope detection HPLC chromatogram representing a 
sample of a dosing solution containing only [ l4 C] glyphosate (11.3 minutes, 98.8%), and trace 
amour's of [ l4 C] AMPA (5.8 minutes, 0.16%) and an unidentified [ 14 C] material (10.2 minutes, 
1%). 
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oxidoreductase mediated glyphosate metabolism. AMPA acyl transferase (AAT), which herein 
may also be known as AMPA acetyltransferase, AMPA transacylase, or acetyl-AMPA synthase 
(AAS), has been shown herein to be capable of acetyl transferase activity, propionyl transferase 
activity, malonyl transferase activity, and succinyl transferase activity. Thus, any biologically 

5 functional equivalent of these compounds (acetyl, propionyl, malonyl, or succinyl) which serves 
• as an acyl-carrier-form of substrate capable of functioning with an AMPA acyltransferase 
enzyme is within the scope of the present invention. One AMPA acyltranferase which has been 
identified, and shown by example herein to function according to the description contained 
herein, has previously been referred to in the art as PhnO, a protein encoded by the phnO gene 

10 within the £. coliphn operon. 

Biological functional equivalents. As used herein such equivalents with respect to the 
AMPA-acyltransferase proteins of the present invention are peptides, polypeptides and proteins 
that contain a sequence or moiety exhibiting sequence similarity to the novel peptides of the 
present invention, such as PhnO, and which exhibit the same or similar functional properties as 

is that of the polypeptides disclosed herein, including transacylase activity. Biological equivalents 
also include peptides, polypeptides and proteins that react with, Le. specifically bind to 
antibodies raised against PhnO and that exhibit the same or similar transacylase activity, 
including both monoclonal and polyclonal antibodies. 

Biological functional equivalents as used herein with respect to genes encoding 

20 acyltransferases are polynucleotides which react with the polynucleotide sequences contemplated 
and described herein, i.e. which are capable of hybridizing to a polynucleotide sequence which is 
or is complementary to a polynucleotide encoding an acyltransferase which functions in 
transacylation of AMPA or which encode substantially similar acyltransferase proteins 
contemplated and described herein. A protein which is substantially similar to the proteins 

25 described herein is a biological functional equivalent and exhibits the same or similar functional 
properties as that of the polypeptides disclosed herein, including improved herbicide tolerance or 
improved herbicide resistance. Biological equivalent peptides contain a sequence or moiety such 
as one or more active sites which exhibit sequence similarity to the novel peptides of the present 
invention, such as PhnO. Biological equivalents also include peptides, polypeptides, and 

30 proteins that react with, i.e. which specifically bind to antibodies raised against PhnO and PhnO- 
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of polymerase loading, binding, and initiation of transcription functions influences the 
expression of sequences which are linked to the promoter. 

Plant-Expressible Coding Regions: Coding regions which are expressible, i.e can be 
transcribed and/or translated in planta. because they contain typical plant regulatory elements to 
5 facilitate the expression of a gene of interest. 

Plastid Transit Peptide: Any amino acid sequence useful in targeting or localizing a 
linked amino acid, such as a protein fusion, to a subcellular compartment or organelle such as a 
plastid or chloroplast. Amino acid sequences which facilitate entry into a mitochondria are not 
altogether unlike or dissimilar from plastid transit peptides, and are also described as transit 
io peptides, but fail to function for targeting peptide sequences to plastid or chloroplast organelles. 

Progeny of a transgenic plant includes any offspring or descendant of the transgenic 
plant which contains at least one heterologous or trans-gene, or any subsequent plant derived 
from the transgenic plant which has the transgene in its lineage. Progeny is not limited to one 
generation, but rather encompasses the descendants of the transgenic plant so long as they 
is contain or express the desired transgene. Seeds containing transgenic embryos as well as seeds 
from the transgenic, plants and their offspring or descendants are also important parts of the 
invention. Transgenic cells, tissues, seeds or plants which contain a desired transgene are 
progeny of the original transgenic cells, tissue, or plant. < 

Promoter: A recognition site on a DNA sequence or group of DNA sequences that 
20 provides an expression control element for a structural gene and to which RNA polymerase 
specifically binds and initiates RNA synthesis (transcription) of that gene. 

Ro is the primary regenerant plant derived from transformation of plant tissue or cells in 
culture. Subsequent progeny or generations derived from the Ro are referred to as R| (first 
generation), R2 (second generation), etc. 
25 Regeneration: The process of producing a whole plant by growing a plant from a plant 

cell or plant tissue (e.g., plant protoplast or explant). 

Structural Coding Sequence refers to a DNA sequence that encodes a peptide, 
polypeptide, or protein that is produced following transcription of the structural coding sequence 
to messenger RNA (mRNA), followed by translation of the mRNA to produce the desired 
30 peptide, polypeptide, or protein product. 

Structural gene: A gene that is expressed to produce a polypeptide. 
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such that the exogenous polynucleotide confers some genotypic or phenotypic trait or traits to 
that cell and to the progeny of the transformed cell, measured by the detection of the 
exogenously introduced polynucleotide, the mRNA or protein product of the exogenous 
polynucleotide, a metabolite not normally produced by or found within the cell in the absence of 
the exogenous polynucleotide, or a visual inspection of the cell, plant tissue, or plants derived 
from the transformed cell. 

Transgene: A transgene is a polynucleotide sequence which has been transferred to a cell 
and comprises an expression cassette containing a structural gene sequence encoding a desired 
polypeptide. The transgene is capable of being expressed when in a recipient transformed cell, 
tissue, or organism. This may include an entire plasmid or other vector, or may simply include 
the plant functional coding sequence of the transferred polynucleotide. A transgenic cell is any 
cell derived from or regenerated from a transformed cell, including the initially transformed cell. 
Exemplary transgenic cells include plant callus tissue derived from a transformed plant cell and 
particular cells such as leaf, root, stem, meristem, and other somatic tissue cells, or reproductive 
or germ line and tapetal cells obtained from a stably transformed transgenic plant. A transgenic 
event is a plant or .progeny thereof derived from the insertion of at least one exogenous 
polynucleotide into the nuclear, plastidic, or mitochondrial genome of a plant cell or protoplast. 
A transgenic plant is a plant or a progeny thereof which has been genetically modified to contain 
and express heterologous polynucleotide sequences as proteins or as RNA or DNA molecules 
not previously a part of the plant composition. As specifically exemplified herein, a transgenic 
cotton plant, for example, is genetically modified to contain and express at least one 
heterologous DNA sequence operably linked to and under the regulatory control of 
transcriptional and translational control sequences which function in plant cells or tissue or in 
whole plants. A transgenic plant may also be referred to as a transformed plant. A transgenic 
plant also refers to progeny of the initial transgenic plant where those progeny contain and are 
capable of expressing the heterologous coding sequence under the regulatory control of the plant 
expressible transcriptional and translational control sequences described herein. A transgenic 
plant can produce transgenic flowers, seeds, bulbs, roots, tubers, fruit, and pollen and the like 
and can be crrssed by convention breeding means with compatible lines of plants to produce 
hybrid transgenic plants. 
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invention. The presence of plant operable introns has also been shown, in maize in particular, to 
enhance gene expression in certain plant species. Introns may be present in an expression 
cassette in any number of positions along the sequence of the cassette. This can include 
positions between the promoter and the 3' termination sequence and/or within a structural gene. 
There may be more than one intron present in an expression cassette, however for the purposes 
*of the contemplated inventions herein, it is preferred that introns be present when expression 
cassettes are used in monocotyledonous plants and plant tissues. Enhancer sequences are also 
well known in the art and may be present, although not necessarily as a part of an expression 
cassette, as enhancer sequences are known to function when present upstream or downstream or 
even at great distances from a promoter driving expression of a gene of interest. 

The expression of a gene localized to the plant nuclear genome and which exists in 
double-stranded DNA form involves transcription to produce a primary messenger RNA 
transcript (mRNA) from one strand of the DNA by RNA polymerase enzyme, and the 
subsequent processing of the mRNA primary transcript inside the nucleus. This processing 
involves a 3' non-translated polynucleotide sequence which adds polyadenylate nucleotides to 
the 3' end of the RNA. Transcription of DNA into mRNA is regulated by a sequence of DNA 
usually referred to as the "promoter". The promoter comprises a sequence of bases that signals 
RNA polymerase to associate with the DNA and to initiate the transcription of mRNA using the 
template DNA strand to make a corresponding complementary strand of RNA. 

Those skilled in the art will recognize that there are a number of promoters which are 
active in plant cells, and have been described in the literature. Such promoters may be obtained 
from plants, plant viruses, or plant commensal, saprophytic, symbiotic, or pathogenic microbes 
and include, but are not limited to, the nopaline synthase (NOS) and octopine synthase (OCS) 
promoters (which are carried on tumor-inducing plasmids of Agrobacterium tumefaciens), the 
cauliflower mosaic virus (CaMV) 19S and 35S promoters, the light-inducible promoter from the 
small subunit of ribulose 1,5-bisphosphate carboxylase (ssRUBISCO, a very abundant plant 
polypeptide), the rice Actl promoter, the Figwort Mosaic Virus (FMV) 35S promoter, the sugar 
cane bacilliform DNA virus promoter, the ubiquitin promoter, the peanut chlorotic streak virus 
promoter, the ccmalina yellow virus promoter, the chlorophyll a/b binding protein promoter, and 
meristem enhanced promoters Act2, Act8, Actl 1 and EF la and the like. All of these promoters 
have been used to create various types of DNA constructs which have been expressed in plants 
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desired protein to in planta levels ranging from 0.1 milligrams per fresh weight gram of plant 
tissue, to 0.5 milligrams per fresh weight gram of plant tissue, to 1.0 milligrams per fresh weight 
gram of plant tissue, to 2.0 or more milligrams per fresh weight gram of plant tissue. The in 
planta levels of a desired protein in genetically isogenic crops in a field can range across a 
spectrum, but generally the levels fall within 70% of a mean, more preferably within 50°A of a 
mean, and even more preferably within 25% of a mean for all plants analyzed in a given sample. 

The promoters used in the DNA constructs (i.e. chimeric plant genes) of the present 
invention may be modified, if desired, to affect their control characteristics. For example, the 
CaMV35S promoter may be ligated to the portion of the Arabidopsis thaliana ribulose-1,5- 
bisphosphate carboxylase small subunit gene (ssRUBISCO) that represses the expression of 
ssRUBISCO in the absence of light, to create a promoter which is active in leaves but not in 
roots. The resulting chimeric promoter may be used as described herein. For purposes of this 
description, the phrase "CaMV35S" promoter thus includes variations of CaMV35S promoter, 
e.g., promoters derived by means of ligation with operator regions, random or controlled 
mutagenesis, et cetera. Furthermore, the promoters may be altered to contain multiple "enhancer 
sequences' 1 to assist in elevating gene expression. Examples of such enhancer sequences have 
been reported by Kay et al. (1987). 

One RNA produced by a DNA construct of the present invention also contains a 5' non- 
translated leader sequence. This sequence can be derived from the promoter selected to express 
the gene, and can be specifically modified so as to increase translation of the mRNA. The 
nontranslated or 5* untranslated leader sequence (NTR or UTR) can be derived from an unrelated 
promoter or coding sequence. For example, the 5' non-translated regions can also be obtained 
from viral RNA's, from suitable eucaryotic genes, or from a synthetic gene sequence. The 
present invention is not limited to constructs, as presented in one of the following examples, 
wherein the non-translated region is derived from the 5' non-translated sequence that 
accompanies the promoter sequence. Examples of plant gene leader sequences which are useful 
in the present invention are the wheat chlorophyll a/b binding protein (cab) leader and the 
petunia heat shock protein 70 (hsp70) leader (Winter et al., 1988). 

For optical expression in rronocotyledonous plants, an intron should also be included in 
the DNA expression construct. This intron would typically be placed near the 5' end of the 
mRNA in untranslated sequence. This intron could be obtained from, but not limited to, a set of 
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for plant survival or plant viability, while limiting expression of the second cassette comprising a 
DNA molecule in other nonessential tissues. For example, tissues which differentiate into pollen 
development or terminal tissues such as the pollen itself, the tapetal cell layer of the anther, or 
the anther tissues. Alternatively, plant promoters capable of regulating the expression of genes 
in particular cell and tissue types are well known. Those that are most preferred in the 
embodiments of this invention are promoters which express specifically during the development 
of the male reproductive tissue or in pollen at levels sufficient to produce inhibitory RNA 
molecules complementary to the sense RNA transcribed by the constitutive promoter of the first 
expression cassette comprising a DNA molecule. Examples of these types of promoters include 
the TA29 tobacco tapetum-specific promoter (Mariani et al. 1990), the PA1 and PA2 chalcone ^ 
flavonone isomerase promoters from petunia (van Tunen et al. 1990), the SLG gene promoter 
from Brassica oleracea (Heizmann et al. 1991), and LAT gene promoters from tomato (Twell et 
al. 1991). 

Anther and pollen-specific promoters from rice have been isolated. Examples include the 
Osg6B promoter, which was shown to drive expression of the ^-glucuronidase gene in transgenic 
rice in immature anthers. No activity was detected in other tissues of spikelets, leaves or roots 
(Yokoi et al. 1997). The PS1 pollen-specific promoter from rice has been shown to specifically 
express the P-glucuronidase gene in rice pollen (Zou et al., 1994). Additional rice genes have 
been identified that specifically express in the anther tapetum of rice (Tsuchiya et al. 1994, 
Tsuchiya et al. 1997). The isolation of additional genes expressed predominantly during anther 
development in rice can be performed, for example, by construction of a cDNA library to 
identify anther specific clones (Qu et al.). 

Those skilled in the art are aware of the approaches used in the isolation of promoters 
which function in plants, and from genes or members of gene families that are highly expressed 
in particular plant tissues such as in roots, shoots, meristem, leaves, flowers, fruits, in pollen, or 
in plant cell types involved in the production of pollen (Stinson et al. 1987; Brown and Crouch. 
1990; McCormick et al. 1989). Further examples of tissue specific promoters include the 
promoter for the exopolygalacturonase gene of maize (Dubald, et al. 1993) and the promoter for 
the Zr~cl3 mRNA (Hanson, et al. 1989). Promoters which have been shown to preferentially 
express in tomato pollen are the LAT52 and LAT59 promoters (Twell et al. 1991). A portion of 
the maize pZtap promoter sequence (psgB6-l) was disclosed in U.S. Patent 5,470,359. 
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peptides are generally cleaved by specific endoproteases located within the chloroplast organelle, 
thus releasing the targeted mature and preferably active enzyme from the precursor into the 
chloroplast melieu. Examples of sequences encoding peptides which are suitable for directing 
the targeting of the herbicide tolerance gene or transacylase gene product to the chloroplast or 
plastid of the plant cell include the petunia EPSPS CTP, the Arabidopsis EPSPS CTP2 and 
intron, and others- known to those skilled in the art. Such targeting sequences provide for the 
desired expressed protein to be transferred to the cell structure in which it most effectively 
functions, or by transferring the desired expressed protein to areas of the cell in which cellular 
processes necessary for desired phenotypic function are concentrated. Chloroplast targeting 
peptides have been found to be particularly useful in the selection of glyphosate resistant plants 
(Barry et al., US Patent No. 5,463,175; Barry et ah, US Patent No. 5,633,435). Glyphosate 
functions to kill the cell by inhibiting aromatic amino acid biosynthesis which takes place within 
the chloroplast. Therefor, concentrating the resistance gene product within the chloroplast 
provides increased resistance to the herbicide. The examples herein provide for a transacylase 
which is also targeted to or localized to and concentrated within the chloroplast. Specific 
examples of chloroplast targeting peptides are well known in the art and include the Arabidopsis 
thaliana ribulose bisphosphate carboxylase small subunit atslA transit peptide, an Arabidopsis 
thaliana EPSPS transit peptide, and a Zea maize ribulose bisphosphate carboxylase small subunit 
transit peptide. One CTP that has functioned herein to localize heterologous proteins to the 
chloroplast was derived from the Arabidopsis thaliana ribulose bisphosphate carboxylase small 
subunit atslA transit peptide. A polynucleotide sequence encoding a variant of this transit 
peptide used herein provides the native transit peptide amino acid sequence plus a reiteration of 
the transit peptide cleavage site, and has been shown herein to be useful for deploying active 
recombinant transacylase enzyme to the chloroplast (SEQ ID NO:9). 

An alternative means for localizing plant operable herbicide tolerance or herbicide 
resistance genes to a chloroplast or plastid includes chloroplast or plastid transformation. 
Recombinant plants can be produced in which only the mitochondrial or chloroplast DNA has 
been altered to incorporate the molecules envisioned in this application. Promoters which 
function in chioroplasts have Hen known in the art (Hanley-Bowden et al., Trends in 
Biochemical Sciences 12:67-70, 1987). Methods and compositions for obtaining cells 
containing chioroplasts into which heterologous DNA has been inserted have been described, for 
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gene sequence are further aspects of this disclosure. A transgenic bacterium, yeast or fungal cell, 
plant cell or plant derived from such a transformation process or the progeny and seeds from 
such a transgenic plant are also further embodiments of this invention. 

Methods for transforming bacteria and yeast or fungal cells are well known in the art. 
Typically, means of transformation are similar to those well known means used to transform 
other bacteria, such as E. coli, or yeast, such as Saccharomyces cerevisiae. Methods for DNA 
transformation of plant cells include, but are not limited to Agrobacterium-mediaitd plant 
transformation, protoplast transformation, gene transfer into pollen, injection into reproductive 
organs, injection into immature embryos, plastid or chloroplast transformation, and particle 
bombardment. Each of these methods has distinct advantages and disadvantages. Thus, one 
particular method of introducing genes into a particular plant species may not be the most 
effective for another plant species, but it is well known by those skilled in the art which methods 
are useful for a particular plant species. 

There are many methods for introducing transforming DNA segments into cells, but not 
all are suitable for delivering DNA to plant cells. Suitable methods are believed to include 
virtually any method by which DNA can be introduced into a cell, such as by Agrobacterium 
infection, binary bacterial artificial chromosome (BIB AC) vectors (Hamilton et al., 1996), direct 
delivery of DNA such as, for example by PEG-medi^ted transformation of protoplasts 
(Omirulleh et al, 1993), by desiccation/inhibition-mediated DNA uptake, by electroporation, by 
agitation with silicon carbide fibers, by acceleration of DNA coated particles, etc. In certain 
embodiments, acceleration methods are preferred and include, for example, microprojectile 
bombardment and the like. 

Technology for introduction of DNA into cells is well-known to those of skill in the art. 
Four general methods for delivering a gene into cells have been described: (1) chemical 
methods (Graham and van der Eb, 1973; Zatloukal et al, 1992); (2) physical methods such as 
microinjection (Capecchi, 1980), electroporation (Wong and Neuman, 1982; Fromm et al, 
1985; U.S. Patent No. 5,384,253) and the gene gun (Johnston and Tang, 1994; Fynan et al, 
1993; Luthra et al., 1997); (3) viral vectors (Clapp, 1993; Lu et al, 1993; Eglitis and 
Anderson, 1988a; 1988b); and (4) receptor-mediated mechanisms (Curiel et al, 1991; 1992; 
Wagner et al, 1992) 
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plants. To potentiate the expression of these genes, the present invention provides DNA 
constructs comprising polynucleotide sequences encoding these types of proteins which are 
localized to the plant cell cytoplasm as well as sequences encoding plastid targeting peptides 
positioned upstream of the polynucleotide sequences encoding the AMPA transacylase and/or 
GOX proteins. 

In one aspect, nucleotide sequence information provided by the invention allows for the 
preparation of relatively short DNA sequences having the ability to specifically hybridize to gene 
sequences of the selected polynucleotides disclosed herein. In these aspects, nucleic acid probes 
of an appropriate length are prepared based on a consideration of selected polynucleotide 
sequences encoding AMPA transacylase polypeptides, e.g., sequences such as are shown in SEQ 
ID NO:l, SEQ ID NO:2, and SEQ ID NO:3. Such nucleic acid probes may also be prepared 
based on a consideration of selected polynucleotide sequences encoding a plastid targeting 
peptide, such as those shown in SEQ ID NO:9, SEQ ID NO: II, SEQ ID NO: 13, and SEQ ID 
NO: 14. The ability of such nucleic acid probes to specifically hybridize to a gene sequence 
encoding an AMPA transacylase polypeptide or a plastid targeting peptide sequence lends to 
them particular utility in a variety of embodiments. Most importantly, the probes may be used in 
a variety of assays for detecting the presence of complementary sequences in a given sample. 

In certain embodiments, it is advantageous to use oligonucleotide primers. The sequence 
of such primers is designed using a polynucleotide of the present invention for use in detecting, 
amplifying or mutating a defined sequence of a AMPA transacylase gene from any suitable 
organism using PCR™ technology. The process may also be used to detect, amplify or mutate a 
defined sequence of the polynucleotide encoding a plastid targeting peptide. Segments of genes 
related to the polynucleotides encoding the AMPA transacylase polypeptides and plastid 
targeting peptides of the present invention may also be amplified by PCR™ using such primers. 

To provide certain of the advantages in accordance with the present invention, a preferred 
nucleic acid sequence employed for hybridization studies or assays includes sequences that are 
substantially complementary to at least a length of 14 to 30 or so consecutive nucleotides of a 
polynucleotide sequence flanking, in cis with, or encoding an AMPA transacylase, such as that 
shown in SEQ !D NO:5 or SEQ T NO:6, or sequences that are substantially complementary to 
at least a length of 14 to 30 or so consecutive nucleotides of a sequence encoding a plastid 
targeting peptide. By "substantially complimentary", it is meant that a polynucleotide is 
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preferably about 70% complimentary, or more preferably about 80% complimentary, or even 
more preferably about 90% complimentary, or most preferably about 99-100% complimentary in 
sequence to a target polynucleotide sequence. 

A size of at least 14 nucleotides in length helps to ensure that the fragment will be of 
sufficient length to form a duplex molecule that is both stable and selective. "Molecules hfaving 
, complementary sequences over segments greater than 14 bases in length are generally preferred. 
In order to increase stability and selectivity of the hybrid, and thereby improve the quality and 
degree of specific hybrid molecules obtained, one will generally prefer to design nucleic acid 
molecules having gene-complementary sequences of 14 to 20 nucleotides, or even longer where 
desired. Such fragments may be readily prepared by, for example, directly synthesizing the 
fragment by chemical means, such as phosphoramidite chemistries for example; by application 
of nucleic acid reproduction technology, such as the PCR™ technology of U. S. Patents 
4,683,195, and 4,683,202 (each specifically incorporated herein by reference); or by excising 
selected DNA fragments from recombinant plasmids containing appropriate inserts and suitable 
restriction sites. 

The present invention also contemplates an expression vector comprising a 
polynucleotide of the present invention. Thus, in one embodiment an expression vector is an 
isolated and purified DNA molecule comprising a prombter operably linked to a coding region 
that encodes a polypeptide of the present invention, which coding region is operatively linked to 
a transcription-terminating region, whereby the promoter drives the transcription of the coding 
region. The coding region may include a segment or sequence encoding a AMPA transacylase 
and a segment or sequence encoding a plastid targeting peptide. The DNA molecule comprising 
the expression vector may also contain a plant functional intron, and may also contain other plant 
functional elements such as sequences encoding untranslated sequences (UTL's) and sequences 
which act as enhancers of transcription or translation. 

As used herein, the terms "operatively linked" or "operably linked" mean that a sequence 
which functions as a promoter is connected or linked to a coding region in such a way that the 
transcription of that coding region is controlled and regulated by that promoter. Means for 
operatively linking a promoter to a coding region to regulate both upstream and downstream are 
well known in the art. 
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Preferred plant transformation vectors include those derived from a Ti plasmid of 
Agrobacterium tumefaciens. as well as those disclosed, e.g., by Herrera-Estrella (1983), Bevari 
(1983), Klee (1985) and Eur. Pat. Appl. No. EP 0120516 (each specifically incorporated herein 
by reference). In addition, plant preferred transformation vectors directed to chloroplast or 
plastid transformation include those disclosed in U.S. Pat. No. 5,693,507 (1997), U.S. Pat. No. 
5,451,513 (1995),-McBride et al. (1995), Staub et al. (1995a), Staub et al. (1995b), and WO 
95/24492 (each specifically incorporated herein by reference). 

Where an expression vector of the present invention is to be used to transform a plant, a 
promoter is selected that has the ability to drive expression in that particular species of plant. 
Promoters that function in different plant species are also well known in the art. Promoters 
useful in expressing the polypeptide in plants are those which are inducible, viral, synthetic, or 
constitutive as described (Odell et al.. 1985), and/or temporally regulated, spatially regulated, 
and spatio-temporally regulated. Preferred promoters include the enhanced CaMV35S 
promoters, and the FMV35S promoter. 

The expression of a gene which exists in double-stranded DNA form localized to the 
plant nuclear genome involves transcription of messenger RNA (mRNA) from the coding strand 
of the DNA by an RNA polymerase enzyme, and the subsequent processing of the mRNA 
primary transcript inside the nucleus. Genes expressed from within a chloroplast or plastid also 
produce an mRNA transcript which is not processed further prior to translation. In any event, 
transcription of DNA into mRNA is regulated by a region of DNA referred to as the "promoter". 
The DNA comprising the promoter is represented by a sequence of bases that signals RNA 
polymerase to associate with the DNA and to initiate the transcription of mRNA using one of the 
DNA strands as a template to make a corresponding strand of RNA. The particular promoter 
selected should be capable of causing sufficient expression of an AMPA acyltransferase enzyme 
coding sequence to result in the production of an herbicide tolerance effective or herbicide 
resistance effective amount of the transacylase protein localized to the desired intracellular 
location. 

Structural genes can be driven by a variety of promoters in plant tissues. Promoters can 
be near-constitutive (i.e. they drive transcription of the transgene in all tissue), such as the 
CaMV35S promoter, or tissue-specific or developmentally specific promoters affecting dicots or 
monocots. Where the promoter is a near-constitutive promoter such as CaMV35S or FMV35S, 
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Other exemplary tissue-specific promoters are corn sucrose synthetase 1 (Yang et aL, 
1990), corn alcohol dehydrogenase I (Vogel et aL, 1989), corn light harvesting complex 
(Simpson, 1986), corn heat shock protein (Odell et aL; 1985), pea small subunit RuBP 
carboxylase (Poulsen et aL, 1986; Cashmore et aL, 1983), Ti plasmid mannopine synthase 
(McBride and Summerfelt, 1989), Ti plasmid nopaiine synthase (Langridge et aL, 1989), petunia 
chalcone isomerase (Van Tunen et aL, 1988), bean glycine rich protein 1 (Keller et aL, 1989), 
CaMV 35s transcript (Odell et aL, 1985) and Potato patatin (Wenzler et aL, 1989) promoters. 
Preferred promoters are the cauliflower mosaic virus (CaMV 35S) promoter and the S-E9 small 
subunit RuBP carboxylase promoter. 

The promoters used in the DNA constructs of the present invention may be modified, if 
desired, to affect their control characteristics. For example, the CaMV35S promoter may be 
ligated to the portion of the ssRUBISCO gene that represses the expression of ssRUBISCO in 
the absence of light, to create a promoter which is active in leaves but not in roots.. For purposes 
of this description, the phrase M CaMV35S" promoter thus includes variations of CaMV35S 
promoter, e.g., promoters derived by means of ligation with operator regions, random or 
controlled mutagenesis, etc. Furthermore, the promoters may be altered to contain multiple 
"enhancer sequences" to assist in elevating gene expression. Examples of such enhancer 
sequences have been reported by Kay et aL (1987). Chloroplast or plastid specific promoters are 
known in the art (Daniell et aL, US Pat. No. 5,693,507; herein incorporated by reference). 
Promoters obtainable from chloroplast genes, for example, such as the psbA gene from spinach 
or pea, the rbcl and atpB promoter regions from maize, and rRNA promoters. Any chloroplast 
or plastid operable promoter is within the scope of the present invention. 

A transgenic plant of the present invention produced from a plant cell transformed with a 
tissue specific promoter can be crossed with a second transgenic plant developed from a plant 
cell transformed with a different tissue specific promoter to produce a hybrid transgenic plant 
that shows the effects of transformation in more than one specific tissue. 

The RNA produced by a DNA construct of the present invention may also contain a 5' 
non-translated leader sequence (5'UTL). This sequence can be derived from the promoter 
selected to exnress the gene, and nan be specifically modified so as to increase translation of the 
mRNA. The 5' non-translated regions can also be obtained from viral RNAs, from suitable 
eukaryotic genes, or from a synthetic gene sequence. The present invention is not limited to 
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glyphosate inhibits the conversion of phosphoenolpyruvic acid and 3-phosphoshikimic acid to 5- 
enolpyruvyl-3-phosphoshikimic acid by inhibiting the enzyme 5-enolpyruvyl-3-phosphoshikimic 
acid synthase (EPSP synthase or EPSPS). Introduction of a transgene encoding EPSPS allows 
the plant cell to resist the effects of glyphosate, especially when the transgene encodes a 
glyphosate insensitive EPSPS enzyme. Thus, as the herbicide glyphosate functions to kill the 
cell by interrupting aromatic amino acid biosynthesis, particularly in the cell's chloroplast, the 
CTP allows increased resistance to the herbicide by concentrating what glyphosate resistance 
enzyme the cell expresses in the chloroplast, i.e. in the target organelle of the cell. Exemplary 
herbicide resistance enzymes include EPSPS and glyphosate oxido-reductase (GOX) genes (see 
Comai, 1985, U.S. Patent No. 4,535,060, specifically incorporated herein by reference in its 
entirety). 

CTPs can target proteins to chloroplasts and other plastids. For example, the target 
organelle may be the amyloplast. Preferred CTP's of the present invention include those 
targeting both chloroplasts as well as other plastids. Specific examples of preferred CTP's 
include the maize RUBISCO SSU protein CTP, and functionally related peptides such as the 
Arabidopsis thaliana RUBISCO small subunit CTP and the Arabidopsis thaliana EPSPS CTP. 
These CTP's are exemplified by the polynucleotide and amino acid sequences shown in SEQ ID 
NO:9, SEQ ID NO:l 1, SEQ ID NO: 13, and SEQ ID NO: Irrespectively. 

Recombinant plants, cells, seeds, and other plant tissues could also be produced in which 
only the mitochondrial or chloroplast DNA has been altered to incorporate the molecules 
envisioned in this application. Promoters which function in chloroplasts have been known in the 
art (Hanley-Bowden et al., Trends in Biochemical Sciences 12:67-70, 1987). Methods and 
compositions for obtaining cells containing chloroplasts into which heterologous DNA has been 
inserted has been described in U.S. Pat. No. 5,693,507 (1997). McBride et al. (WO 95/24492) 
disclose localization and expression of genes encoding Cryl A 5-endotoxin protein in tobacco 
plant chloroplast genomes. 

An exemplary embodiment of the invention involves the plastid or chloroplast targeting 
or plastid or chloroplast localization of genes encoding enzymes or proteins conferring herbicide 
tolerance or herbicide resistance in plants. Plastid or chloroplast targeting sequences have been 
isolated from numerous nuclear encoded plant genes and have been shown to direct importation 
of cytoplasmically synthesized proteins into plastids or chloroplasts (reviewed in Keegstra and 
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Olsen, 1989). A variety of plastid targeting sequences, well known in the art, including but not 
limited to ADPGPP, EPSP synthase, or ssRUBISCO, may be utilized in practicing this 
invention. In addition, plastidic targeting sequences (peptide and nucleic acid) for 
monocotyledonous crops may consist of a genomic coding fragment containing an intron 
sequence as well as a duplicated proteolytic cleavage site in the encoded plastidic targeting 
. sequences. 

The preferred CTP sequence for dicotyledonous crops is referred to herein as (SEQ ID 
NO:9), and consists of a genomic coding fragment containing the chloroplast targeting peptide 
sequence from the EPSP synthase gene of Arabidopsis thaliana in which the transit peptide 
cleavage site of the pea ssRUBISCO CTP replaces the native EPSP synthase CTP cleavage site 
(Klee etal, 1987). 

For optimized expression in monocotyledonous plants, an intron may also be included in 
the DNA expression construct. Such an intron is typically placed near the 5' end of the mRNA 
in untranslated sequence. This intron could be obtained from, but not limited to, a set of introns 
consisting of the maize heat shock protein (HSP) 70 intron (U. S. Patent No. 5,424,412; 1995), 
the rice Act! intron (McElroy et al* 1990), the Adh intron 1 (Callis et n/. f 1987), or the sucrose 
synthase intron (Vasil et al. 9 1989). 

The 3' non-translated region of the genes of the present invention which are localized to 
the plant nuclear genome also contain a polyadenylation signal which functions in plants to cause 
the addition of adenylate nucleotides to the 3' end of the mRNA. RNA polymerase transcribes a 
nuclear genome coding DNA sequence through a site where polyadenylation occurs. Typically, 
DNA sequences located a few hundred base pairs downstream of the polyadenylation site serve 
to terminate transcription. Those DNA sequences are referred to herein as transcription- 
termination regions. Those regions are required for efficient polyadenylation of transcribed 
messenger RNA (mRNA). Examples of preferred 3' regions are (1) the 3' transcribed, non- 
translated regions containing the polyadenylation signal of Agrobactehum tumor-inducing (Ti) 
plasmid genes, such as the nopaline synthase (NOS) gene and (2) the 3' ends of plant genes such 
as the pea ribulose-l,5-bisphosphate carboxylase small subunit gene, designated herein as E9 
(Fischhoff et ai, 1987). Constructs will typically include the gene of interest along with a 3' end 
DNA sequence that acts as a signal to terminate transcription and, in constructs intended for 
nuclear genome expression, allow for the poly-adenylation of the resultant mRNA. The most 
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preferred 3' elements are contemplated to be those from the nopaline synthase gene of 
A. tumefaciens (nos 3'end) (Bevan et ai, 1983), the terminator for the T7 transcript from the * 
octopine synthase gene of A. tumefaciens, and the 3' end the protease inhibitor I or II genes 
from potato or tomato. Regulatory elements such as TMV Q element (Gallie, et ai, 1989), may 
further be included where desired. 

According to the present invention and as noted above, chloroplast or plastid localized 
genes encoding enzymes conferring herbicide tolerance or herbicide resistance characteristics to 
plants do not require sequences which confer transcription termination and polyadenylation 
signals, but instead may only require transcription termination information at the 3' end of the 
gene. For coding sequences introduced into a chloroplast or plastid, or into a chloroplast or 
plastid genome, mRNA transcription termination is similar to methods well known in the 
bacterial gene expression art. For example, either in a polycistronic or a monocistronic 
sequence, transcription can be terminated by stem and loop structures or by structures similar to 
rho dependent sequences. 

Transcription enhancers or duplications of enhancers could be used to increase 
expression. These enhancers often are found 5' to the start of transcription in a promoter that 
functions in eukaryotic cells, but can often be inserted in the forward or reverse orientation 5' or 
y to the coding sequence. Examples of enhancers incliide elements from the CaMV 35S 
promoter, octopine synthase genes (Ellis et ai, 1987), the rice actin gene, and promoter from 
non-plant eukaryotes (e.g., yeast; Ma et al., 1988). 

In certain embodiments of the invention, the use of internal ribosome binding sites 
(IRES) elements are used to create multigene, or polycistronic, messages. IRES elements are 
able to bypass the ribosome scanning model of 5 methylated Cap dependent translation and 
begin translation at internal sites (Pelletier and Sonenberg, 1988). IRES elements from two 
members of the picornavirus family (polio and encephalomyocarditis) have been described 
(Pelletier and Sonenberg, 1988), as well an IRES from a mammalian message (Macejak and 
Samow, 1991). IRES elements can be linked to heterologous open reading frames. Multiple 
open reading frames can be transcribed together, each separated by an IRES, creating 
polycistronic messages. By virtu^ of the IRES element, each open reading frame is accessible to 
ribosomes for efficient translation. Multiple genes can be efficiently expressed using a single 
promoter/enhancer to transcribe a single message. 
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A variety of methods have been developed to operatively link DNA to vectors via 
complementary cohesive termini or blunt ends. For instance, complementary homopolymer 
tracts can be added to the DNA segment to be inserted and to the vector DNA. The vector and 
DNA segment are then joined by hydrogen bonding between the complementary homopolymeric 
tails to form recombinant DNA molecules. 

A coding 'region that encodes a polypeptide having the ability to confer enhanced 
herbicide resistance enzymatic activity to a cell is preferably a polynucleotide encoding an 
AMPA transacylase or a functional equivalent, alone or in combination, with a gene encoding a 
GOX enzyme or a functional equivalent of GOX. In accordance with such embodiments, a 
coding region comprising the DNA sequence of SEQIDNO:3, SEQ ID NO:7, or SEQ ID 
NO: 19 is also preferred. 

Specific genes encoding AMPA transacylase that have been shown to successfully 
transform plants in conjunction with plastid targeting peptide-encoding genes, to express the 
AMPA transacylase at sufficient herbicidally protective levels are those genes comprised within 
the plasmid vectors. Preferred plasmids containing plastid targeting sequences include 
pMON17261, pMON10151, P MON10149, pMON32570, pMON32571, pMON32572, 
pMON32573, pMON32926, pMON32931, pMON32932, pMON32936, pMON32938, 
pMON32946, pMON32947, pMON32948, and pMON32950. These plasmids contain 
polynucleotide sequences which encode targeting sequences as shown in SEQ ID NO:9, SEQ ID 
NO:ll, SEQ ID NO:13, SEQ ID NO:14. Expression cassettes comprising plant operable 
promoters linked to coding sequences, some with and some without 5' untranslated sequences 
and/or intron sequences, wherein the coding sequences contain at least an AMPA transacylase or 
transacetylase, linked to plant operable termination sequences are disclosed in particular as set 
forth in SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:31. 

The work described herein has identified methods of potentiating inplanta expression of 
an AMPA transacylase, which confer protection from glyphosate and related herbicides to plants 
when incorporated into the nuclear, plastid, or chloroplast genome of susceptible plants which 
also express a GOX or similar gene. U. S. Patent 5,500,365 (specifically incorporated herein by 
refer:nce) describes a method for synthesizing plant genes to optimize the expression level of the 
protein for which the synthesized gene encodes. This method relates to the modification of the 
structural gene sequences of the exogenous transgene, to make them more "plant-like" and 
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elements such as enhancer sequences. Promoters which direct specific or enhanced expression in 

certain plant tissues will be known to those of skill in the art in light of the present disclosure. 

It also is contemplated that tissue specific expression may be functionally accomplished 

by introducing a constitutively expressed gene (all tissues) in combination with an antisense gene 

t 

that is expressed only in those tissues where the gene product is not desired. For example, a gene 
' coding for the AMPA transacylase from E. coli may be introduced such that it is expressed in all 
tissues using the 35S promoter from Cauliflower Mosaic Virus. Alternatively, a rice actin 
promoter or a histone promoter from a dicot or monocot species also could be used for 
constitutive expression of a gene. Furthermore, it is contemplated that promoters combining 
elements from more than one promoter may be useful. For example, U. S. Patent 5,491,288 
discloses combining a Cauliflower Mosaic Virus promoter with a histone promoter. Therefore, 
expression of an antisense transcript of the AMPA transacylase gene in a maize kernel, using for 
example a zein promoter, would prevent accumulation of the transacylase in seed. Thus, in a 
plant expressing both GOX and the transacylase, application of glyphosate herbicide would 
result in seed tissues which fail to mature. Conversely, antisense suppression of the GOX gene 
would effectuate the same result. Preferably, suppression of the transacylase in specific tissues 
would be more advantageous, particularly where specific tissues have demonstrated an 
intolerance to AMPA or related compounds. It is specifically contemplated by the inventor that 
a similar strategy could be used with the instant invention to direct expression of a screenable or 
selectable marker in seed tissue. 

Alternatively, one may wish to obtain novel tissue-specific promoter sequences for use in 
accordance with the present invention. To achieve this, one may first isolate cDNA clones from 
the tissue concerned and identify those clones which are expressed specifically in that tissue, for 
example, using Northern blotting. Ideally, one would like to identify a gene that is not present in 
a high copy number, but which gene product is relatively abundant in specific tissues. The 
promoter and control elements of corresponding genomic clones may this be localized using the 
techniques of molecular biology known to those of skill in the art. 

It is contemplated that expression of some genes in transgenic plants will be desired only 
under specified conditions. For example, it is proposed that expression of certain genes that 
confer resistance to environmentally stress factors such as drought will be desired only under 
actual stress conditions. It further is contemplated that expression of such genes throughout a 
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plants development may have detrimental effects. It is known that a iarge number of genes exist 
that respond to the environment. For example, expression of some genes such as rbcS, encoding 
the small subunit of ribulose bisphosphate carboxylase, is regulated by light as mediated through 
phytochrome. Other genes are induced by secondary stimuli. For example, synthesis of abscisic 
acid (ABA) is induced by certain environmental factors, including but not limited to water #ress. 
A number of genes have been shown to be induced by ABA (Skriver and Mundy, 1990). It also 
is expected that expression of genes conferring resistance to applications of herbicides would be 
desired only under conditions in which herbicide is actually present. Therefore, for some desired 
traits, inducible expression of genes in transgenic plants will be desired. 

It is proposed that, in some embodiments of the present invention, expression of a gene in 
a transgenic plant will be desired only in a certain time period during the development of the 
plant. Developmental timing frequently is correlated with tissue specific gene expression. For 
example expression of zein storage proteins is initiated in the endosperm about 15 days after 
pollination. 

It also is contemplated that it may be useful to specifically target DNA insertion within a 
cell. For example, it may be useful to target introduced DNA to the nucleus, and in particular 
into a precise positiort within one of the plant chromosomes in order to achieve site specific 
integration. For example, it would be useful to have a gene introduced through transformation 
which acts to replace an existing gene in the cell, or to complement a gene which is not 
functional or present at all. 

A plant transformed with an expression vector of the present invention is also 
contemplated. A transgenic plant derived from such a transformed or transgenic cell is also 
contemplated. Those skilled in the art will recognize that a chimeric plant gene containing a 
structural coding sequence of the present invention can be inserted into the genome of a plant by 
methods well known in the art. Such methods for DNA transformation of plant cells include 
Agrobacterium-mediated plant transformation, the use of liposomes, transformation using 
viruses or pollen, electroporation, protoplast transformation, gene transfer into pollen, injection 
into reproductive organs, injection into immature embryos and particle bombardment. Each of 
these methods has distinct advantages and disadvantages. Thus, one particular method of 
introducing genes into a particular plant so*ain may not necessarily be the most effective for 
another plant strain, but it is well known which methods are useful for a particular plant strain. 
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There are many methods for introducing transforming DNA segments into cells, but not 
all are suitable for delivering DNA to plant cells. Suitable methods are believed to include 
virtually any method by which DNA can be introduced into a cell, such as infection by A. 
tumefaciens and related Agrobacterium strains, direct delivery of DNA such as, for example, by 
PEG-mediated transformation of protoplasts (Omirulleh et al, 1993), by desiccation/inhib'ition- 
' mediated DNA uptake, by electroporation, by agitation with silicon carbide fibers, by 
acceleration of DNA coated particles, etc. In certain embodiments, acceleration methods are 
preferred and include, for example, microprojectile bombardment and the like. 

Technology for introduction of DNA into cells is well-known to those of skill in the art. 
Four general methods for delivering a gene into cells have been described: (1) chemical methods 
(Graham and van der Eb, 1973); (2) physical methods such as microinjection (Capecchi, 1980), 
electroporation (Wong and Neumann, 1982; Fromm et ai, 1985) and the gene gun (Johnston and 
Tang, 1994; Fynan etai, 1993); (3) viral vectors (Clapp, 1993; Lu etai, 1993; Eglitis and 
Anderson, 1988a; 1988b); and (4) receptor-mediated mechanisms (Curiel etai, 1991; 1992; 
Wagner et ai, 1992). 

The application of brief, high-voltage electric pulses to a variety of animal and plant cells 
leads to the formation of nanometer-sized pores in the plasma membrane. DNA is taken directly 
into the cell cytoplasm either through these pores or as a consequence of the redistribution of 
membrane components that accompanies closure of the pores. Electroporation can be extremely 
efficient and can be used both for transient expression of cloned genes and for establishment of 
cell lines that carry integrated copies of the gene of interest. Electroporation, in contrast to 
calcium phosphate-mediated transfection and protoplast fusion, frequently gives rise to cell lines 
that carry one, or at most a few, integrated copies of the foreign DNA. 

The introduction of DNA by means of electroporation is well-known to those of skill in 
the art. To effect transformation by electroporation, one may employ either friable tissues such as 
a suspension culture of cells, or embryogenic callus, or alternatively, one may transform 
immature embryos or other organized tissues directly. One would partially degrade the cell 
walls of the chosen cells by exposing them to pectin-degrading enzymes (pectolyases) or 
mechanically wounding in a controlled manner, rendering the cells more susceptible to 
transformation. Such cells would then be recipient to DNA transfer by electroporation, which 



PCT/US99/27I52 



■42 



^ ** aaM °- a < «* "WC and ^fo^ cd]s . . p 

screening protocol dependem 0 „ fc *«* to — W bv . suitable selecIion 

Afimh. j me °'" en a»lymcotporatedDNA 
A further advantageous method for deliverino tra , • 

" >ta*M bombardnrem. ,„ u, is met jr ? * 8meMS " > ta < «* 

■ ^ered i„,o « Us by . T " " — * «■* -ids and 

- «« -a cnopiaa. on the surface . ^, 7 ' ^ " "» «" - 

• "* «< .98, Kawata . „ £ ; f ;7" - bribed (Kiein „ „,„ , 9S7; 

«* - *us a„ow u,e of ce 7l!T iCleS ~ ,hr0U8 " ° f 

- bon.bardn.en, me d,od is preferred for * " ^ " KUe The microp^e 

— —ta^ "* *■*"'» ° f or plasdd dlTOted 

An adva„ age of microprojeoile bombardnren. in „MV 
Of repnsduciWv stablv transfer™, '"; dd '»™ '0 „ being an effecive means 

- < «») nor me s „ S ce P ,ioi„" 10 ^ot-o^j . ne.dier die isolation of protoplasts (Cristou 

- of a m«hod fo deiiven 1 C T " " * ""-*» 

Oeitve. Svsrem, * h Tb ^ ~ * * is a Bioiisdes 

*-* a screen, ^ . . ^ * ^ P— coated win, DNA oc cella 

PUn, cuhured cells in ^" - a Hirer surfece covered te 

deiivered „ to r ec ip ,„, M|is „ Iarge , * so rba, thev are no, 

» *«— * projectiie apparatus and I oTTTb !" ^ *" ' — ~* 
«»-• and mav comr^ , 0 . " * *. of proves 

- r— ^ ---zrd;t^::r: MUsmaybs ^°"- 

«°PPing plate. ,f desired one „ '*"' r0Pni " e *— *• 

«— *« device and the ce „ s ,o be bonded ^TT " ^ to 

■ « »ay obain up ,„ , 000 or more J^. ^ *" » ° f « be re . 



WO 00/29596 



PCI7US99/27152 



-43- 

In bombardment transformation, one may optimize the pre -bombardment culturing 
conditions and the bombardment parameters to yield .the maximum numbers of stable 
transformants. Both the physical and biological parameters for bombardment are important in 
this technology. Physical factors are those that involve manipulating the DftA/microprc§ectile 
; t precipitate or those that affect the flight and velocity of either the macro- or microprojectiles. 
Biological factors include all steps involved in manipulation of cells before and immediately 
after bombardment, the osmotic adjustment of target cells to help alleviate the trauma associated 
with bombardment, and also the nature of the transforming DNA. such as linearized DNA or 
intact supercoiled plasmids. It is believed that pre-bombardment manipulations are especially 
important for successful transformation of immature plant embryos. 

Accordingly, it is contemplated that one may desire to adjust various of the bombardment 
parameters in small scale studies to fully optimize the conditions. One may particularly wish to 
adjust physical parameters such as gap distance, flight distance, tissue distance, and helium 
pressure. One may also minimize the trauma reduction factors (TRFs) by modifying conditions 
which influence the physiological state of the recipient cells and which may therefore influence 
transformation and integration efficiencies. For example, the osmotic state, tissue hydration and 
the subculture stage or cell cycle of the recipient * cells may be adjusted for optimum 
transformation. The execution of other routine adjustments will be known to those of skill in the 
art in light of the present disclosure. 

The methods of panicle-mediated transformation is well-known to those of skill in the 
art. U. S. Patent 5,015,580 (specifically incorporated herein by reference) describes the 
transformation of soybeans using such a technique. 

Agrobacterium-mediated transfer is a widely applicable system for introducing genes into 
plant cells because the DNA can be introduced into whole plant tissues, thereby bypassing the 
need for regeneration of an intact plant from a protoplast. The use of Agrobacterium-medioxtd 
plant integrating vectors to introduce DNA into plant cells is well known in the art. See, for 
example, the methods described (Fraley et al, 1985; Rogers et aL, 1987). The genetic 
engineering of cotton plants using Agrobacterium-mzdidted transfer is described in U. S. Patent 
5,004,863 (speciiically incorporated nerein by reference); like transformation of lettuce plants is 
described in U. S. Patent 5,349,124 (specifically incorporated herein by reference); and the 
Agrobacterium-medizted transformation of soybean is described in U. S. Patent 5,416.011 
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An independent segregant may be preferred when the plant is commercialized as a 
hybrid, such as corn. In this case, an independent segregant containing the gene is crossed with 
another plant, to form a hybrid plant that is heterozygous for the gene of interest. 

An alternate preference is for a transgenic plant that is homozygous for the addad 
structural gene; i.e. a transgenic plant that contains two added genes, one gene at the same locus 
on each chromosome of a chromosome pair. A homozygous transgenic plant can be obtained by 
sexually mating (selfing) an independent segregant transgenic plant that contains a single added 
gene, germinating some of the seed produced and analyzing the resulting plants produced for 
gene of interest activity and mendelian inheritance indicating homozygosity relative to a control 
(native, non-transgenic) or an independent segregant transgenic plant. 

Two different transgenic plants can be mated to produce offspring that contain two 
independently segregating added, exogenous genes. Selfing of appropriate progeny can produce 
plants that are homozygous for both added, exogenous genes that encode a polypeptide of 
interest. Back-crossing to a parental plant and out-crossing with a non-transgenic plant are also 
contemplated. 

Transformation of- plant protoplasts can be achieved using methods based on calcium 
phosphate precipitation, polyethylene glycol treatment, electroporation, and combinations of 
these treatments (see e.g., Potrykus et al, 1985; Lorz et al, 1985;'Fromm et al, 1985; Uchimiya 
etal, 1986; Callis <?/ a/., 1987; Marcotte etal, 1988). 

Application of these systems to different plant germplasm depends upon the ability to 
regenerate that particular plant variety from protoplasts. Illustrative methods for the regeneration 
of cereals from protoplasts are described (see, e.g., Fujimura et al, 1985: Toriyama et al, 1986; 
Yamada et al., 1986; Abdullah et al, 1986). 

To transform plant germplasm that cannot be successfully regenerated from protoplasts, 
other ways to introduce DNA into intact cells or tissues can be utilized. For example, 
regeneration of cereals from immature embryos or explants can be effected as described (Vasil, 
1988). 

Unmodified bacterial genes are often poorly expressed in transgenic plant cells. Plant 
codon usage more closely resembles that of humans and other higher organisms than unicellular 
organisms, such as bacteria. Several reports have disclosed methods for improving expression of 
recombinant genes in plants ( Murray et al, 1989; Diehn et al. 1996; Iannacone et al, 1997; 
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Rouwendal et al.. 1997; Futterer el al., 1997; and Futterer and Hohn, 1996). These reports 
disclose various\nethods for engineering coding sequences to represent sequences which are 
more efficiently translated bascH m plant codon frequency tables, improvements in codon third 
base position bias, using recombinant sequences which avoid suspect polyadenylation or A/T 
rich domains or intron splicing consensus sequences. 

U. S. Patent 5:500,365 (specifically incorporated herein by reference) describes the 
preferred method for synthesizing plant genes to optimize the expression level of the protein for 
which the synthesized gene encodes. This method relates to the modification of the structural 
gene sequences of the exogenous transgene, to make them more "plant-like" and therefore more 
likely to be translated and expressed by the plant, monocot or dicot. However, the method as 
disclosed in U. S. Patent 5,689,052 provides for enhanced expression of transgenes, preferably in 
monocotyledonous plants, which is herein incorporated in its entirety by reference. Briefly, 
according to Brown et al., the frequency of rare and semi-rare monocotyledonous codons in a 
polynucleotide sequence encoding a desired protein are reduced and replaced with more 
preferred monocotyledonous codons. Enhanced accumulation of a desired polypeptide encoded 
by a modified polynucleotide sequence in a monocotyledonous plant is the result of increasing 
the frequency of preferred codons by analyzing the coding sequence in successive six nucleotide 
fragments and altering the sequence based on the frequency of appearance of the six-mers as to 
the frequency of appearance of the rarest 284, 484, and 664 six-mers in monocotyledenous 
plants. Furthermore, Brown et al. disclose the enhanced expression of a recombinant gene by 
applying the method for reducing the frequency of rare codons with methods for reducing the 
occurrence of polyadenylation signals and intron splice sites in the nucleotide sequence, 
removing self-complementary sequences in the nucleotide sequence and replacing such 
sequences with nonself-complementary nucleotides while maintaining a structural gene encoding 
the polypeptide, and reducing the frequency of occurrence of 5'-CG-3' di-nucleotide pairs in the 
nucleotide sequence. These steps are performed sequentially and have a cumulative effect 
resulting in a nucleotide sequence containing a preferential utilization of the more-preferred 
monocotyledonous codons for monocotyledonous plants for a majority of the amino acids 
present in the desired polypeptide. 

Thus, the amount of a gene coding for a polypeptide of interest can be increased in plants 
by transforming those plants using transformation methods such as those disclosed herein. In 
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particular, chloroplast or piastid transformation can result in desired coding sequences being 
present in up to about 10.000 copies per cell in tissues containing these subcellular organelle 
structures (McBride et al., Bio/Technology 13:362-365, 1995). 

DNA can also be introduced into plants by direct DNA transfer into pollen as described 
(Zhou et al y 1983; Hess, 1987). Expression of polypeptide coding genes can be obtained by 
injection of the DNA Into reproductive organs of a plant as described (Pena et al 9 1987). DNA 
can also be injected directly into the cells of immature embryos and introduced into cells by 
rehydration of desiccated embryos as described (Neuhaus et al, 1987; Benbrook et al, 1986). 

After effecting delivery of exogenous DNA to recipient cells, the next step to obtain a 
transgenic plant generally concern identifying the transformed cells for further culturing and 
plant regeneration. As mentioned herein, in order to improve the ability to identify 
transformants, one may desire to employ a selectable or screenable marker gene as, or in addition 
to, the expressible gene of interest. In this case, one would then generally assay the potentially 
transformed cell population by exposing the cells to a selective agent or agents, or one would 
screen the cells for the desired marker gene trait. 

An exemplary embodiment of methods for identifying transformed cells involves 
exposing the transformed cultures to a selective agent, such as a metabolic inhibitor, an 
antibiotic, herbicide or the like. Cells which have been transformed and have stably integrated a 
marker gene conferring resistance to the selective agent used, will grow and divide in culture. 
Sensitive cells will not be amenable to further culturing. One example of a preferred marker 
gene confers resistance to glyphosate. When this gene is used as a selectable marker, the 
putatively transformed cell culture is treated with glyphosate. Upon treatment, transgenic cells 
will be available for further culturing while sensitive, or non-transformed cells, will not. This 
method is described in detail in U. S. Patent 5,569,834, which is specifically incorporated herein 
by reference. Another example of a preferred selectable marker system is the neomycin 
phosphotransferase (nptll) resistance system by which resistance to the antibiotic kanamycin is 
conferred, as described in U. S. Patent 5,569,834 (specifically incorporated herein by reference). 
Again, after transformation with this system, transformed cells will be available for further 
culturing upon tres ment with kanam; ;in, while non-transformed cells will not. Yet another 
preferred selectable marker system involves the use of a gene construct conferring resistance to 
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panmaomycin. Use of this type of a select marker sysrem is described in U S. Paren. 
5 424 412 (specifically incoiporated herein by reference). 

Anorher preferred selectable marker sysrem involves .he use of rh= genes contemplared 
bv mis invention. In panicular, a ,/mO gene or a subsramially similar gene encoding an AMPA 
.ransacylase can be utilized as a selecrable marker. Plan, cells which have had a reco-nta^, 
DNA molecule inrroduced inro rheir genome can be selected from a popularron of cells whrch 
failed .o incorporate a recombinant molecule by growing the cells in the presence of AMPA. 
One skilled in the art will recognize the particular advantages the, this selectable marker systtm 
has over previous selecrable marker sysrems. The selemble marker used in .he recombtnan, 
DNA integrated into a plan, genome reduces the amount of DNA targeted for integration because 
■ha selectable marker will also be used for improved herbicide .olerance or improved herbtctde 
resistance in plan, generated from transformed plan, cells. This selectable marker also provides 
an addition* marker sysrem no, known before, particularly in a Held in which mere are often 
only a limited number of selectable markers available. 
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syntiresis. Transformation of chloroptos or plastids w„h a suitable bacerial ooo gene, or wrti, a 
gene encoding a speotinomycin resist plastid or chloroplas. functional ribosomal RNA 
provides a means for selection and maimenance of rransplasuanomic eve„.s (Malign, Trends m 

Biotechnology 11:101-106, 1993). 
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those that cause 100% inhibition followed by screening of growing tissue for expression of a 
screenable marker gene such as kanamycin would allow one to recover transformants from cell 
or tissue types that are not amenable to selection alone. It is proposed that combinations of 
selection and screening may enable one to identify transformants in a wider variety of cell and 
tissue types. The availability of the transacylases of the present invention may obviate* the 
necessity for combination selection and screening by providing an additional selection means. 

The development or regeneration of plants from either single plant protoplasts or various 
explants is well known in the art (Weissbach and Weissbach, 1988). This regeneration and 
growth process typically includes the steps of selection of transformed cells, culturing those 
individualized cells through the usual stages of embryonic development through the rooted 
plantlet stage. Transgenic embryos and seeds are similarly regenerated. The resulting transgenic 
rooted shoots are thereafter planted in an appropriate plant growth medium such as soil. 

The development or regeneration of plants containing the foreign, exogenous gene that 
encodes a polypeptide of interest introduced by Agrobacterium from leaf explants can be 
achieved by methods well known in the art such as described (Horsch el a!., 1985). In this 
procedure, transformants are cultured in the presence of a selection agent and in a medium that 
induces the regeneration of shoots in the plant strain being transformed as described (Fraley et 
al, 1983). In particular, U. S. Patent 5,349,124 (specification incorporated herein by reference) 
details the creation of genetically transformed lettuce cells and plants resulting therefrom which 
express hybrid crystal proteins conferring insecticidal activity against Lepidopteran larvae to 
such plants. 

This procedure typically produces shoots within two to four months and those shoots are 
then transferred to an appropriate root-inducing medium containing the selective agent and an 
antibiotic to prevent bacterial growth. Shoots that rooted in the presence of the selective agent to 
form plantlets are then transplanted to soil or other media to allow the production of roots. These 
procedures vary depending upon the particular plant strain employed, such variations being well 
known in the art. 

Preferably, the regenerated plants are self-pollinated to provide homozygous transgenic 
plants, or pollen obtained from the regenerated plants is crossed to seed-grown plants of 
agronomically important, preferably inbred lines. Conversely, pollen from plants of those 
important lines is used to pollinate regenerated plants. A transgenic plant of the present 
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invention containing a desired polypeptide is cultivated using methods well known to one skilled 
in the art. ^ 

In one embodiment, a transgenic plant of this invention thus has an increased amount of a 
coding region encoding an AMPA transacylase polypeptide which may also be expressed along 
with a plastid targeting peptide. A preferred transgenic plant is an independent, segregant a»d 
can transmit that gene and its activity to its progeny. A more preferred transgenic plant is 
homozygous for that gene, and transmits that gene to all of its offspring on sexual mating. Seed 
from a transgenic plant may be grown in the field or greenhouse, and resulting sexually mature 
transgenic plants are self-pollinated to generate true breeding plants. The progeny from these 
plants become true breeding lines that are evaluated for expression of the transacylase transgene 
as well as for improved herbicide tolerance, particularly when the transacylase transgene is co- 
expressed along with a gene encoding a GOX enzyme. 

The genes and acyltransferases according to the subject invention include not only the 
foil length sequences disclosed herein but also fragments of these sequences, or fusion proteins, 
which retain the characteristic improved herbicidal protective activity of the sequences 

specifically exemplified herein. 

It should be apparent to a person of skill in this art that AMPA transacylase genes and 
peptides can be identified and obtained through several means. The specific genes, or portions 
thereof, may be obtained from a culture depository, or constructed synthetically, for example, by 
use of a gene machine. Variations of these genes may be readily constructed using standard 
techniques for making point mutations. Also, fragments of these genes can be made using 
commercially available exonucleases or endonucleases according to standard procedures. For 
example, enzymes such as JU31 or site-directed mutagenesis can be used to systematically cut 
off nucleotides from the ends of these genes. Also, genes which code for active fragments may 
be obtained using a variety of other restriction enzymes. Proteases may be used to dtrectly 
obtain active fragments of such transacylases. 

Equivalent AMPA transacylases and/or genes encoding these transacylases can also be 
isolated from E. coli strains and/or DNA libraries using the teachings provided herein. For 
example antibodies to the transacylases disclosed and claimed herein can be used to identify and 
isolate other transacylases from a mixture of proteins. Specifically, antibodies may be raised to 
the transacylases disclosed herein and used to specifically identify equivalent AMPA 
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transacylases by immunoprecipitation, column immuno-purificatioa enzyme linked 
immunoassay (ELISA), or Western blotting. 

A further method for identifying the peptides and genes of the subject invention is 
through the use of oligonucleotide probes. These probes are nucleotide sequences having a 
detectable label. As is well known.in the art, if the probe molecule and sequences in a target 
' nucleic acid sample hybridize by forming a strong bond between the two molecules, it can be 
reasonably assumed that the probe and target sample contain essentially identical polynucleotide 
sequences. The probe's detectable label provides a means for determining in a known manner 
whether hybridization has occurred. Such a probe analysis provides a rapid method for 
identifying AMPA transacylase genes of the subject invention. 

The nucleotide segments which are used as probes according to the invention can be 
synthesized by use of DNA synthesizers using standard procedures. In the use of the nucleotide 
segments as probes, the particular probe is labeled with any suitable label known to those skilled 
in the art, including radioactive and non-radioactive labels. Typical radioactive labels include 
32 P, l25 1, 35 S, or the like. A probe labeled with a radioactive isotope can be constructed from a 
nucleotide sequence, complementary to the DNA sample by a conventional nick translation 
reaction, using a DNase and DNA polymerase. The probe and sample can then be combined in a 
hybridization buffer solution and held at an appropriate temperature until annealing occurs. 
Thereafter, the membrane is washed free of extraneous materials, leaving the sample and bound 
probe molecules typically detected and quantified by autoradiography and/or liquid scintillation 
counting. 

Non-radioactive labels include, for example, ligands such as biotin or thyroxin, as well as 
enzymes such as hydrolyses or peroxidases, or the various chemiluminescers such as luciferin, or 
fluorescent compounds like fluorescein, rhodamine, Texas Red, and derivatives and the like. 
The probe may also be labeled at both ends with different types of labels for ease of separation, 
as, for example, by using an isotopic label at the end mentioned above and a biotin label at the 
other end, or with different fluorescent emitters which have overlapping absorption and emission 
spectra. 

Duplex formation and staFlity depend on substantial complementary between the two 
strands of a hybrid, and, as noted above, a certain degree of mismatch can be tolerated. 
Therefore, the probes of the subject invention include mutations (both single and multiple), 
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cleavage recognition sequence or for the purpose of altering the coding sequence so that a gene's 
codons and percent G/C represent those more commonly used by a particular genus or species. 
Site-specific mutagenesis allows the production of deletion, insertion, or replacement mutations 
through the use of specific mutagenesis oligonucleotide sequences comprising the DNA 
sequence of the desired mutation. Mutagenesis oligonucleotides typically provide a primer 
sequence of sufficient size and sequence complexity to form a stable duplex on both sides of the 
desired mutation target site. Typically, a primer of about 17 to 25 nucleotides in length is 
preferred, with about 5 to 10 residues overlapping either side of the desired mutation target site. 

in general, the technique of site-specific mutagenesis is well known in the art, as 
exemplified by various publications. As will be appreciated, the technique typically employs a 
phage vector which exists in both a single stranded and double stranded form. Typical vectors 
useful in site-directed mutagenesis include vectors such as the M13 phage. These phage are 
readily commercially available and their use is generally well known to those skilled in the art. 
Double stranded plasmids are also routinely employed in site directed mutagenesis, and often 
contain a filamentous phage origin of replication which, in the presence of a helper phage, allows 
synthesis of single stranded DNA from the plasmid vector. 

In general, site-directed mutagenesis in accordance herewith is performed by first 
obtaining a single-stranded vector or melting apart of two ( strands of a double stranded vector 
which includes within its sequence a mutation target site. A mutagenesis oligonucleotide primer 
bearing the desired mutant sequence is prepared, generally synthetically. The mutagenesis 
primer is then annealed with the single-stranded vector at the mutation target site, and subjected 
to DNA polymerizing enzymes such as E. coli polymerase I Klenow fragment, in order to 
complete the synthesis of the mutation-bearing strand. Thus, a heteroduplex is formed wherein 
one strand encodes the original non-mutated sequence and the second strand bears the desired 
mutation. This heteroduplex vector is then used to transform appropriate cells, such as E. coli 
cells, and clones are selected which include recombinant vectors containing the mutation 
represented by the mutagenesis primer sequence. 

The preparation of sequence variants of the selected peptide-encoding DNA segments 
using site-directed mutagenesis is provided as a means of producing potentially useful species 
and is not meant to be limiting as there are other ways in which sequence variants of peptides 
and the DNA sequences encoding them may be obtained. For example, recombinant vectors 
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encoding the desired peptide sequence may be treated with mutagenic agents, such as 
hydroxylamine. to obtain sequence variants. Such procedures may favorably change the 
protein's biochemical and biophysical characteristics or its mode of action. These include, but 
are not limited to: 1) improved AMPA transacylase formation, 2) improved protein stability or 
reduced protease degradation, 3) improved substrate recognition and binding, 4) improved 
enzyme kinetics, and 5) improved N-acyl-AMPA formation due to any or all of the reasons 
stated above. 

Modification and changes may be made in the structure of the peptides of the present 
invention and DNA segments which encode them and still obtain a functional molecule that 
encodes a protein or peptide with desirable characteristics. The biologically functional equivalent 
peptides, polypeptides, and proteins contemplated herein should possess at least from about 40% 
to about 65% sequence similarity, preferably from about 66% to about 75% sequence similarity, 
more preferably from about 76% to about 85% similarity, and most preferably from about 86% 
to about 90% or greater sequence similarity to the sequence of, or corresponding moiety within, 
the AMPA acyltransferase amino acid sequences disclosed herein. 

The following is a discussion based upon changing the amino acids of a protein to create 
an equivalent, or even an improved, second-generation molecule. In particular embodiments of 
the invention, mutated AMPA transacylase proteins are contemplated to be useful for improving 
or enhancing the in planta expression of the protein, and consequently increasing or improving 
the AMPA transacylase activity and/or expression of the recombinant transgene in a plant cell. 
The amino acid changes may be achieved by changing the codons of the DNA sequence, 
according to the codons given in Table 1, in dicotyledonous, and more particularly in 
monocotyledonous plants. 
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Table 1 



Amino Acid 



Codons 



Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


UGC 


UGU 










Aspartic acid 


Asp 


D 


GAC 


GAU 






- 


* 


Glutamic acid 


Glu 


E 


. GAA 


GAG 










Phenylalanine 


Phe 


F 


T IT 


T IT IT T 
UUU 




GGU 

V_J vJ w 






Glycine 


Gly 


U 


llOA 




fine; 






Histidine 


HIS 


TJ 

n 














Isoleucine 


ue 




AHA 


AUC 


AUU 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


cuu 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 




ecu 






Proline 


Pro 


P 


CCA 


CCC 


CCG 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


UCG 


ucu 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


w 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











For example, certain amino acids may be substituted for other amino acids in a protein 
structure without appreciable loss of interactive binding capacity with structures such as, for 
example, antigen-binding regions of antibodies or binding sites on substrate molecules. Since it 
is the interactive capacity and nature of a protein that defines that protein's biological functional 
activity, certain amino acid sequence substitutions can be made in a protein sequence, and, of 
course, its underlying DNA coding sequence, and nevertheless obtain a protein with like 
properties. It is thus contemplated by the inventor that various changes may be made in the 
peptide sequences of the disclosed compositions, or corresponding DNA sequences which 
encode said peptides without appreciable loss of their biological utility or activity. 

In ma-iing such changes, he hydropathic index of amino acids may be considered. The 
importance of the hydropathic amino acid index in conferring interactive biologic function on a 
protein is generally understood in the art (Kyte and Doolittle, 1982, incorporate herein by 
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hydrophilicity, charge, size, and the like. Exemplary substitutions which take various of the 
foregoing characteristics into consideration are well known to those of skill in the art and 
include: arginine and lysine; glutamate and aspartate; serine and threonine; glutamine and 
asparagine; and valine, leucine and isoleucine. 

Polynucleotides encoding heterologous proteins are known by those skilled in the art, to 
' often be poorly expressed when incorporated into the nuclear DNA of transgenic plants 
(reviewed by Diehn et aL, 1996). Preferably, a nucleotide sequence encoding a heterologous 
protein of interest is designed essentially as described in U. S. Patent 5,500,365 and 5,689,052 
(each specifically incorporated herein by reference). Examples of nucleotide sequences useful 
for expression include but are not limited to, SEQ ID NO:3, SEQ ID NO:7, SEQ ID NO:l 1, and 
SEQ ID NO:19. 

Substitutes for an amino acid within the fundamental polypeptide sequence can be 
selected from other members of the class to which the naturally occurring amino acid belongs. 
Amino acids can be divided into the following four groups: (1) acidic amino acids; (2) basic 
amino acids; (3) neutral polar amino acids; and (4) neutral non-polar amino acids. 
Representative amino acids within these various groups include, but are not limited to: (1) acidic 
(negatively charged) amino acids such as aspartic acid and glutamic acid; (2) basic (positively 
charged) amino acids such as arginine, histidine, and lysine j (3) neutral polar amino acids such 
as glycine, serine, threonine, cyteine, cystine, tyrosine, asparagine, and glutamine; (4) neutral 
nonpolar (hydrophobic) amino acids such as alanine, leucine, isoleucine, valine, proline, 
phenylalanine, tryptophan, and methionine. 

Conservative amino acid changes within a fundamental polypeptide sequence can be 
made by substituting one amino acid within one of these groups with another amino acid within 
the same group. The encoding nucleotide sequence (gene, plasmid DNA, cDNA, or synthetic 
DNA) will thus have corresponding base substitutions, permitting it to encode biologically 
functional equivalent forms of an AMP A transacylase. 

The following examples describe preferred embodiments of the invention. Other 
embodiments within the scope of the claims herein will be apparent to one skilled in the art of 
endeavor from consideration of the specification or practice of the invention as disclosed herein. 
It is intended that the specification, together with the examples, be considered exemplary only, 
with the scope and spirit of the invention being indicated by the claims which follow the 
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examples. In the examples all percentages are given on a weight basis unless otherwise 
indicated. ^ 

EXAMPLES 

Example 1 

5 This example illustrates the growth inhibitory effects of N-aminomethyl phosphonicf acid 

, (AMPA) on plant callus tissue, and the lack of inhibition of N-acetyl-aminomethyl phosphonic 
acid on plant callus tissue in in vitro culture conditions. 

Certain recombinant plant species which express a bacterial GOX gene, and which were 
also exposed to glyphosate, can exhibit phytotoxic effects manifested through such symptoms as 

io chlorosis, flower abscission, and reduced fertility. The basis for these symptoms had not 
previously been determined. Previous studies had indicated that plants expressing GOX 
metabolized glyphosate to AMPA and glyoxylate (U.S. Patent No. 5,463,175). Glyoxylate is 
readily metabolized by plants, however AMPA persists in plant tissues and may be the cause of 
phytotoxic effects such as chlorosis, stunting, or other undesireable effects. It had previously 

15 been shown that Achromobacter species LBAA was able to enzymatically modify AMPA to N- 
acetyl AMPA (U.S. Patent No. 5,463,175). The Achromobacter data, coupled with the plant 
phytotoxicity data, indicated that N-acyiation of AMPA in planta may provide effective relief 
from chlorosis and other undesireable effects. Thus, tobacco callus tissue was exposed to AMPA 
and to N-acetyl AMPA in order to determine if either of these compounds exhibited cytotoxic 

20 effects similar to those observed in plants expressing GOX and exposed to glyphosate. 

Tobacco callus was generated from leaf pieces of wild type Nicotiana tabacum cv. 
"Samsun" tobacco on MS 104 plates (MS salts 4.3 g/1, sucrose 30 g/1, B5 vitamins 500X 2 ml/1, 
NAA 0.1 mg/1, and Bacto Agar 1.0 mg/1). Callus tissue was applied to plates with or without 
AMPA and with or without N-acetyl AMPA. Plates contained AMPA or N-acetyl AMPA at 

25 concentrations of 0.1 mM or 0.4 mM. Plates were incubated for up to three weeks and 
monitored periodically. 

Callus tissue on control plates containing no AMPA or N-acetyl AMPA grew at normal 
rates, regenerating roots and shoots as expected. Callus tissue in the presence of AMPA was 
severely inhibited. No growth was observed, showing the phytotoxic effect of AMPA at these 

30 concentrations. Callus tissue on plates containing N-acetyl AMPA was not inhibited, and 
formed roots and shoots similar to control callus tissue growth. This result indicated that 
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AMPA, as a byproduct of GOX mediated metabolism of glyphosate, could be responsible for the 1 
observed phototoxicity in plants. This result also indicated the possibility of an improved 
method for selecting plants from genetically transformed callus tissue, as well as a possible 
method for enhancing glyphosate herbicide resistance. 
Example 2 

This example illustrates that degradation of glyphosate by GOX enzyme hydrolysis in the 
bacterium Achromobacter sp. strain LBAA results in the production of AMPA and N-acetyl 
AMPA. 

It has been previously shown that GOX mediated glyphosate degradation produced 
glyoxylate and AMPA (Barry et aL US 5,463,175). Achromobacter sp. strain LBAA was also 
shown to produce AMPA and glyoxylate as a result of glyphosate degradation. The glyphosate 
degradation pathway was characterized in resting cells of glyphosate-grown Achromobacter sp. 
strain LBAA according to the following procedure. Cells from a 100 ml culture of LBAA, 
grown in DF3S medium containing glucose, gluconate and citrate as carbon sources and with 
thiamine and Yeast Extract (0.01%) to supply trace requirements and with glyphosate at 0.2 mM 
as a phosphorous source, were harvested at a cell density of 200 Klett units, washed twice with 
20 ml of DF3S medium and the equivalent of 20 ml of cells were resuspended in 100 1 of the 
same medium containing [ 14 C]glyphosate (2.5 ml of 52 mCi/mmol, Amersham; CFA.745). The 
cell mix was incubated at 30°C with shaking and 20ml samples were withdrawn at various 
intervals. The samples were centrifuged to separate the cells from the broth supernatant. Both 
the supernatant and cell pellets were analyzed by HPLC. 

Samples prepared in this way were analyzed by strong anion exchange (SAX) HPLC 
with radioisotope label detection to determine their levels of [ l4 C]-AMPA and N-acetyl-[ ,4 C]- 
AMPA. Samples were injected using a Waters WISP autoinjector. Chromatographic profiles and 
quantitative data were collected using MACS2, Monsanto's automated chromatography data 
collection system. A Spherisorb S5 SAX, 250 mm X 10mm column, or an Alltech 5 micron, 
250mmX10mm SAX column was used for the analyses. Solvents used were designated as 
solution A and solution B. Solution A contained 0.005M KH 2 P0 4 adjusted to pH 2.0 with 
H3PO4 in 4% methanol. Solution B contained 0.10 M KH 2 P0 4 adjusted to pH 2.0 with H3PO4 in 
4% methanol. Each sample run time consisted of a step gradient program with an eluent flow 
rate of 3 ml per minute and a scintillation fluid (tradename ATOMFLOW, No. NEN-995 
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obtained from Packard Instruments) flow rate of 9 ml per minute. Tne HPLC solvent profile for 
distinguishing [ ,4 C]-AMPA from N-acetyl-[ l4 C]-AMPA in each sample analyzed was 
represented by 100% solvent A at times zero through 5 minutes, then solvent B at 100% at time 
5 minutes through 15 minutes, then 100% solvent A through 20 minutes at which time the 
column is prepared to receive another sample. ' 

Cell pellets were first resuspended in DF3S medium made acidic by addition of 0.65N 
HCl, boiled for 5 minutes, then centrifuged briefly to provide a solution phase for HPLC 
analysis. Supernatants were treated similarly prior to HPLC analysis. An acidified glyphosate 
control was also subjected to HPLC analysis, and the glyphosate retention time (RT) was 
determined to be 10.8 minutes. The amount of radioactivity in the glyphosate peak remaining in 
the supernatant after two hours incubation had decreased to about 33% of the initial levels, 
indicating that the glyphosate was extensively metabolized. About 3% of the glyphosate was 
found to be within the cell. Material co-eluting with the methylamine standard with an RT of 6 
minutes accounted for about 5% of the initial amount of radioactivity in the supernatant and for 
about 1.5% of the initial amount of radioactivity identified in the cell contents. 

The GOX mediated glyphosate degradation pathway was elucidated further in a 
subsequent experiment where the metabolism of [ l4 C]AMPA was compared to that of 
[ ,4 C]glyphosate as indicated above in resting cells harvested at 165 Klett units and resuspended 
at the equivalent of 15 ml cells per 100 ml DF3S medium. The samples were analyzed by HPLC 
and consisted of whole cultures acidified and treated as described above. Cultures exposed to 
[ l4 C]glyphosate for two hours were found to have 25% of the label in the methylamine/N-acetyl- 
methylamine peak with a retention time of 14.7 minutes, 12.5% as AMPA with a retention time 
of 6 minutes, 30% in a peak with a retention time of 13.2 minutes, and 30% as glyphosate with a 
retention time of 10.8 minutes. Analysis of cultures exposed to [ ,4 C]-AMPA for two hours 
indicated that 15% of the label was found as N-acetyl-methylamine / methylamine, 59% as 
AMPA, and 18% in the 13.2 minute peak. The material eluting at 13.2 minutes was identified as 
N-acetyl-AMPA by negative ion electrospray mass spectrometry. The result showed strong ions 
at m/e 152 and m/e 154, as expected for this compound, which has a molecular weight of 153 
Daltons. The m/e 154 ion was due to the isotopic l4 C atom. N-acetyl-methyl-[ 14 C]-AMPA arises 
from N-methyI-[ ,4 C]-AMPA, which is a known impurity in preparations of [ !4 C]-AMPA. 
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These data indicated that the glyphosate degradation pathway in Achromobacter strain 
LBAA proceeds from hydrolysis of glyphosate to AMP A, which is then converted to the 
products rnethylamine presumably through a dephosphorylation step, and N-acetyl-AMPA 
presumably through some previously unknown transacylation step. A small amount of N- 
acetyl-AMPA is then converted to N-acetyl-methylamine. A similar acylation step has 'been 
inferred from the products identified in E. coli when aminomethylphosphonates are utilized as 
sole sources of phosphate (Avila et al., 1987). 
Example 3 

This example illustrates the identification of an AMPA acyltransferase activity in E. coli. 
Avila et al. (1987) identified dephosphorylated biodegradation products from the 
metabolism of a variety of aminophosphonate substrates used as sole phosphate sources in vivo 
in E. coli while studying C-P bond scission. Their studies indicated that AMPA was not a 
substrate for acylation in E. coli K-12. In addition, Avila et al. were interested in the effect of 
N-linked chemical substitutions on C-P bond scission of phosphonates in E. coli, and identified 
N-acetylated products derived from the metabolism of some aminophosphonates. Avila et al. 
also demonstrated that 'wild type' £ coli K12 strains, unlike wild type E. coli B strains, are 
unable to use phosphonates as a source of phosphate. Thus, in consideration of the phytotoxic 
effects of AMPA on callus tissue as shown in Example 1 and the generation of AMPA from 
GOX mediated glyphosate degradation as shown in Example 2, the E. coli data in Avila et al. 
indicated that there may be an enzyme or pathway present in some bacterial species which is 
capable of converting aminomethylphosphonate (AMPA) to N-acetyl-AMPA. An enzyme or 
pathway with those characteristics would, if expressed in plants, confer a significant advantage 
to plants expressing GOX when treated with glyphosate. 

To test this, an E. coli K-12 strain adapted for growth on AMPA was grown on low 
phosphate containing medium in order to obtain cell lysates to be assayed for the presence of an 
enzyme capable of AMPA N-acylation. The phn (mpu) operon is cryptic in E. coli K-12 due to 
an 8 base pair insertion which causes a frameshift mutation in the phnE gene. The frameshift 
inactivates PhnE and creates a polar effect on translation of other genes downstream of phnE 
within the operon, resulting in the inability of such mutants to use phosphonates as phosphate 
sources (Makino et al., J. Bacterid. 173:2665-2672, 1991). Selection of a spontaneously derived 
mutation restores the function of the phn operon (phn+ or mpu+). Thus, K-12 strains adapted 
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— « shakin, a, W " LoTs Id T " ^ 

- Phosphate supped wirh J^SlZZX '\T " " 
20 generally grown to about 220 Ki«t ■, , P °' 11,6 Cultures were 

P«s a, ,,000 psi. Lysalss * ' , " Md lyXd ™ h - P— a French 

cultures grown in this manner. ^ 2 Sh0WS ^ resu,ts of <*» 

Table 2. 



Effects of Phos D h a ,. g .,^„... ir| „ Gromh 




.Growth Period fhro 
Harvest Density (Klett 

Units) 

- indicates no measurable growth 
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An HPLC assay was used to determine the presence or absence of any AMPA 
acyltransferase activity in the media and cell lysates. The assay monitors the conversion of 
[ l4 C]AMPA to N-acetyl-[ 14 C] AMPA. Generally, 100 jil of a 2X assay solution consisting of 
16.5 mg acetyl-CoA, 250 nl of 2M Tris, pH 7.5, 4.5 mis dH 2 0 and [ 14 C]AMPA (30mM) was 
5 mixed with 25-75 }il of lysate and 1 .|il each of 0.5 M MgCI 2 and MnCl 2 , and brought to 200 |il 
with dH20. The assay was incubated for 30 minutes at 37°C, and quenched with 200 jil 90-100 
mM NaOAc (sodium acetate) pH 4.4 in ethanol and then analyzed immediately by HPLC as 
described above, or stored at -20°C. Only GB993 lysate samples derived from cultures grown 
in the presence of AMPA or 0.1 mM phosphate supplemented media demonstrated appreciable 
10 AMPA acyltransferase activity. This result indicated that a gene encoding an acyltransferase 
enzyme capable of AMPA N-acylation was present in GB993 and was regulated for expression 
when grown under low phosphate conditions. Thus, the coding sequence for the enzymatic 
activity appears to be part of the pho regulon and may reside in the phn operon. 
Example 4 

is This example illustrates the identification of an E, coli phn operon gene encoding an 

enzyme capable of AMPA acylation. 

Example 3 indicated that the AMPA acyltransferase activity observed in lysates of E. coli 
may be encoded by a gene in the phn operon. The entire p/w'operon in E. coli B and in E. coli K- 
12 has previously been cloned and sequenced B (Wanner et al., Chen et al.). The E. coli K-12 

20 phn operon DNA sequence has been shown to be identical to the published DNA sequence of the 
phn operon from E. coli B with the exception of an eight base pair insertion in the phnE gene 
(Wanner et al). Clones containing various amounts of the phn operon genes from either 
bacterial genetic background are readily available (Wanner et al., Chen et al., Dr. J.W. Frost at 
Purdue University). Plasmids containing differing amounts of the JM101 phn operon DNA were 

25 used to transform JMlOl(wpw-) in order to test for a plasmid localized phn gene that, when 
expressed, confers upon JM101 the ability to utilize AMPA as a sole phosphate source. 

A plasmid obtained from J. Frost (Dr. J.W. Frost, Department of Chemistry, Purdue 
University, West Lafayette, Indiana 47907), designated herein as pF, contains an E. coli K-12 8 
kb EcoRl frag, lent which encode, the phn operon genes ph*G through phnQ. A single Ncol site 

30 is present at the 5' end of the phnG coding region. Plasmid pF was digested with EcoRl and 
Ncol, releasing a 2 kb Ncol-EcoRl fragment containing the genes phnQ through phril, and a 



WO 00/29596 



PCT/US99/27152 



-64- 



second Ncol-EcoZl fragment about 6 kb in length containing the genes phni through phnQ. 
Each fragment w»s gel punfied and ligated into a cloning and expression vector in an orientals 
which would allow for expression of the phn operon genes present within each of the Ncol- 
EcoBl fragments from a plasmid borne inducible promoter. The 2 kb fragment was inserted into 
the Afcol - EcoM sites within the vector pMON7258, a positive selection cloning vector identical 
to pUC118 with the exception of polylinker domain (Viera et aL, Methods Enzymol. 153:3, 
1987), the resulting plasmid being designated as P 58-l. The orientation of the 2 kb fragment in 
p58-l allows for the expression of the phnG-phnl genes from the lac promoter within the vector. 
The 6 kb EcoRl-Ncol fragment was inserted into the Neol and EcoRl sites in a similar positive 
selection vector, pMON7259, producing the plasmid designated as pMON17195. P MON7259 is 
identical to pUC119 except for the polylinker domain, which contains a multiple cloning site 
opposite in orientation to that within P MON7258, and which also allows for expression of the 
phnJ-phnQ genes from a lac promoter. P 58-l and P MON7259 were transformed into E. coli 
K12 (mpu-) strain JM101, and maintained with ampicillin antibiotic resistance selection. 
pMON7259 and pF were also transformed into JM101 as negative and positive controls, 
respectively. 

Cultures of each transformant were grown overnight in M9 liquid broth media 
supplemented with 2% casamino acids, thiamine, and 0.2% glucose with shaking at 37°C, and 
then diluted 1:50 into 50 ml of fresh pre-warmed media of the same composition in a 250 ml 
side-armed Erlenmeyer flask. Cultures were incubated with shaking at 37°C until reaching a cell 
density of about 80-100 Klett Units as measured on a Klett-Summerson spectrophotometer 
through a #2 green filter. Expression from the plasmid lac promoter was induced by the 
addition of 100 microliters of 500 mM IPTG so that the final IPTG concentration was about 1 
mM The induction phase growth period was allowed to progress for two hours. Table 3 shows 
the cell density profile of each culture from 1 :50 dilution through the two hour induction penod. 
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Tablc3. 



Induction Profile of JM101 Cultures Harboring Various phn Plasmids 



Culture/ Plasmid 


IPTG 


Io 


Ii 


h 


pMON7259 


+ 


13 


75 


222 


p58-l 


+ 


15 


70 


212 


pMON17195 


+ 


15 


90 


220 ■* 


pF 


+ 


17 


97 


290 


pF 




15 




260 



I 0 indicates the cell culture density at the 1:50 dilution time point; Ii indicates the cell culture 
density at the time of IPTG addition; and h indicates the cell culture density at the time of 
harvest. 



The cells in each culture were harvested by centrifugation at 10,000 rpm for 10 minutes 
at 4°C in a Beckman J2 centrifuge. The cell pellet was washed one time in ice cold 154 mM 
NaCl solution, and then resuspended in 1.5 ml extraction buffer (50 mM Tris-HCl pH 7.5, 1 mM 
DTT, 50 mM Tris-HCl pH 7.5). Cell suspensions were ruptured with two passes through a 
French Press at 1000 psi. The resulting lysate was centrifuged for 15 minutes at 14,000 rpm at 
4°C in an EPPENDORF™ model 5402 microcentrifuge in order to remove debris. Each cleared 
lysate was transferred to a fresh pre-chilled tube and the volume of the extract was adjusted to 
2.5 ml with 50 mM Tris-HCl pH 7.5. A PD10 column was equilibrated with 25 ml 50 mM Tris- 
HCl, pH 7.5 and then each sample was applied to the desalting column. Each eluted sample was 
adjusted to 3.5 ml with 50 mM Tris-HCl, pH 7.5. Each sample was distributed to assay tubes 
and mixed with reagents in order to assay for the presence of AMPA acyltransferase activity as 
shown in Table 4. 

Table 4. 

Assay Conditions for Bacterial Lysates Expressing phn Genes 



Sample 


IPT 


Extract 


50 mM Tris 


2X Assay 


Total 


G 


Volume* 


Volume* 


Mix 


Volume* 










Volume* 




pMON7259 


+ 


25 


75 


100 


200 


pMON7259 


+ 


100 


0 


100 


200 


p58-l 


+ 


25 


75 


100 


200 


p58-l 


+ 


100 


0 


100 


200 


pMON17195 


+ 


25 


75 


100 


200 


pMON17195 


+ 


100 


0 


100 


200 


pF 


+ 


25 


75 


100 


200 


pF 


+ 


100 


0 


100 


200 


pF 




25 


75 


100 


200 


pF 




100 


0 


100 


200 




na 


0 


100 


100 


200 



♦all volumes are in microliters 

Composition of mixtures of each sample, designated by plasmid content, as 
prepared for AMPA acyltransferase assay. 
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Each « « incubated « „. c fM 3( , mmuKs md qu£nched m 

of ,0,00 mM NaOAc (sodinn, ac= a ,e>, P H 4, ,„ erhano, md if „ 01 
— a ly by HPLC as desCbad above, .hen stored overn ig „, a, -20'C. Unused ^ of 
each lys a:= were s.orad ehhe, a, 4»C. or nrixed whh gly ce,„, ,o ,0% by vo lme , and ^ „ . 

hplc fT- ° f ^ T 10 ' h£ AMPA *» -y-J by 

HPLC for ft. presence of [ CJAMPA and ac y ,a,ed ("CJAMPA, as deschbed above The 
results are shown in Table 5. 
Table 5. 

HPLC Analysis of Bacterial Lysate 
Conversion of AMPA to Acetyi-AMPA 



Sample 



PMON7259 



PMON7259 



p58-l 



p58-l 



pMON 17195 



PMON17195 



%Acetyl 
AMPA 



no data 



8 



%AMPA 



no data 



13 



100 



pF 



pF 



pF 



pF 



100 



92 



95 



87 



61 



97 



52 



90 



39 



48 



10 



100 



Results of HPLC analysis of each sample, indicatine the 
relanve amount of ["q AMPA or acetyH'^AMPA'as a 
J^geof the total amountof ["q in boVpeaks combined 



This data indicated that the p.asmid containing the 6 kb Ncol-EcoRl fragment isolated 

womot " PMONm95 COmained ^ ~ ~ * h ' <*» «™ ^ducL 0 fl t 
P m ter m an mpu . strain o{ R co/ , ^ fc rf _ _ 

capable of converting all of the ^AMPA available jn ^ ^ ^ tQ * 
Th^ne or genes retired for AMPA N-acylation were mrther defined by restriction deletion 

Plasmids containing various segments of the,/,* operon from either £ co// B or £ co/7 
K-12 were constructed to further delineate the nature of the,*,, operon gene or genes involved 
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in conferring AMPA acyltransferase activity when expressed in an mpu- E. coli JM101. 
pMON7333 contains the pMON17195 equivalent E. coli DNA insertion, but in pUCl 19, and is a 
single E. coli B strain HincHYl fragment containing the wild type phn operon genes pk»G through 
phnQ. pMON15020 was constructed by cloning a 5,713 base pair Nco\ to EcoRl E. coli B DNA 
fragment from pMON7333 into pMON7259, and contains the genes phni through phnQ. 
pMON 15022 was-constructed by inserting a 1,686 base pair EcoRl to Sail fragment from 
pMON17195 into the positive selection cloning and expression vector pBlueScriptSP 
(Invitrogen), which contains the £ coli K-12 genes phnQ, P and Q. pMON 15023 was 
constructed by deleting an L820 base pair Sail fragment from pMON17195, leaving behind the 
£ coli K- 1 2 phn operon genes phni and phnK, the 5'end of phnU and all of phnO, P and Q. 

The plasmids pMON17195, pMON15020, pMON15022, pMON15023, and pMON7259 
were transformed into the mpu- E. coli K-12 strain JM101 and were maintained by ampicillin 
antibiotic selection. Overnight cultures of each of these transformants were grown with 
antibiotic selection and were diluted 1:50 into fresh M9 media as described above, and 
incubated at 37°C with shaking in 250 ml sidearm-Erlenmeyer flasks to a cell density of about 
100 Klett units. Each culture was induced with IPTG as in example 3, and incubated for two 
additional hours with shaking. The cells were harvested by centrifugation in a Beckman J2 
centriftige at 4,000 RPM for 10 minutes at 4°C. Cell pellets were washed once with 50 ml of 
154 mM NaCl, and stored at -20°C 

Cell pellets were resuspended in 1.5 ml Extraction Buffer as in example 3 and ruptured 
by two passes through a French Press at 1000 psi. The ruptured cell suspensions were 
centriftiged in an Eppindorf microcentrifuge Model 5402 for 15 minutes at 14,000 rpm and at 
4°C. The cleared lysates were decanted into new tubes pre-chilled on ice, and the total volume 
was adjusted to 2.5 ml with addition of Extraction Buffer. These samples were desalted over a 
PD10 column pre-equilibrated with 25 ml of 50 mM Tris-HCl, pH 7.5, and eluted with 3.5 ml of 
50 mM Tris HC1 pH 7.5. Samples were then subjected to an AMPA acylation assay as described 
above, incubated for 30 minutes at 37°C, and quenched with 200 microliters of 90.9 mM NaOAc 
pH 4.4. The volumes of each sample used in the assay are noted in Table 6. All volumes 
represent micro Mters of each solution used. 
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Table 6. 




uZXZSzr ach samp,e ' indica,in8 *• re,a,ive — ° f ^ ^ - 

J* I C]AMPA a, , pe rc e« ee of the lotal „_„„,,, of (Mq ^ ^ 
Table 7. 

HPLCA„ lysi50fBac , erla | Lysa ,„ , <c| , AM1>A Convm . on M Am AMpA 
S a in. p I e f y J rTTr~T — %[ 14 cjl — r™""^ — — - 




10 



pMON 7259 , , _ _ 

combined percentage of Jk total amount of ["cj in both peaks 
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The data in Table 7 indicates that AMPA acylation activity is derived from the phn operon open 
reading frames consisting of phnO, phn?, and phnQ, which are the only phn genes present in 
pMON 15022. Other plasmids conferring AMPA acylation activity upon induction also 
contained at least the phnQ, P, and Q genes, providing strong evidence that the observed activity 
was the result of one or more of these gene products. Therefore, additional plasmids were 
constructed based- on the phnO, P, and Q gene sequences in order to determine which gene or 
genes were required for the acylation function. 

Bacterial acylase, transacylase, and acyltransferase genes have been known in the 
literature for some time. Most are small 15-25 K Da proteins. Therefore, on the basis of size 
comparison, only the phnO and phnQ gene products would fall into this category. However, 
based on similarity comparisons with other proteins in the GENBANK, SWISSPROT, and 
EMBL databases, the predicted phnO gene product appeared to most closely resemble other 
proteins having acylase activity. For example, the E. coli PhnO protein aligned well with a 
gentamicin acetyltransferase-3-I described in Wohlleben et al. (Mol. Gen. Genet. 217:202-208, 
1989). pMON 15020 containing the E. coli B phn operon genes p/mJ through phn? on a single 
6.0 kb Ncol-EcoRl fragment was digested with Sa/I and EcoM to release a 2.0 kb fragment 
containing the phnO, P and Q genes. This 2 kb fragment was excised and purified from a 0.7% 
TAE Agarose gel , treated with T4 DNA polymerase to'excise the 3' overhanging ends, then with 
Klenow and deoxynucleotide triphosphates (dXTP's) to provide blunt ends, and then ligated into 
the £coRV site of pBlueScriptSP to produce plasmid pMON15024. pMON15024 was digested 
with Ndel and £coRI, deleting a 1200 base pair fragment containing most of the phn? and all of 
the phnQ coding sequences. The remaining pMON 15024 plasmid fragment still containing the 
phnO gene was treated with Klenow fragment DNA polymerase in the presence of 
dideoxynucleotides according to the manufacturer's instructions in order to fill in the 3' ends 
exposed by restriction enzyme digestion, then ligated together to produce the plasmid 
pMON15027. pMON15027 contains only the phnO gene flanked 3' by a small portion of phn?. 
The 1200 base pair Ndel to £coRI fragment obtained from pMON15024 was cloned into 
pMON2123 to produce pMON15026, which contains the 3' two thirds of the phn? gene flanked 
3" by phnQ. Plasmids pMON 15024, 15026, and 15027 were introduced into mpu- JM 101, and 
cell lysates of transformants were analyzed as above after growth and induction for the presence 
of AMPA acyltransferase activity. Only pMON15024 and pMON15027 exhibited 
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acyltransferase activity, indicating that the phnO gene product was responsible for AMPA 
acylation. ^ 

A DNA fragment containing only the phnO gene with convenient flanking restriction 
endonuclease sites for use in further cloning manipulations was produced using thermal cycling 
methods. Synthetic oligonucleotide primers were synthesized by Midland Certified Reagents, 
t Co. (Midland Texas) based on the published phnO gene and flanking sequence in order to 
amplify the phnO gene (Chen et al, J. Biol. Chem. 256: 4461-4471, 1990). The sequence 
AAACACCATGGCTGCTTGTG (SEQ ID NO: 5), designated AATPCR6, represents a 
synthetic oligonucleotide which is homologous to the template strand of the phnO gene. The 5* 
adenosine residue of SEQ ID NO: 5 corresponds to base pair 13,955 of the published phn 
operon sequence, immediately 5* of the phnO ATG initiation codon at position 13,962-13,964 
(Chen et al., J. Biol. Chem. 256: 4461-4471, 1990). SEQ ID NO: 5 incorporates a single base 
pair mismatch from the published phnO sequence at position 13,965 represented by a C to G 
inversion, which generates an alanine codon in place of a proline codon at position 2 and also 
creates a unique Ncol restriction site spanning the ATG initiation codon. The sequence 
GTGACGAATTCGAGCTCATTACAGCGCCTTGGTGA (SEQ ID NO: 6), designated 
AATPCR7, represents a synthetic oligonucleotide which is homologous to the coding strand of 
the phnO gene. The 3' adenosine residue of SEQ ID NO: 6 corresponds to base pair 14,380 of 
the published phn operon (Chen et al., J. Biol. Chem. 256: 4*461 -4471, 1990). The thymidine at 
position number nineteen of SEQ ID NO: 6 corresponds to the adenosine at position 14,396 of 
the published phnO sequence (Chen et al.). A portion of SEQ ID NO: 6 overlaps the native 
phnO termination codon, introduces a second in frame termination codon immediately 3' of and 
adjacent to the native termination codon, and also introduces unique EcoRl and Sad restriction 
sites 3 r of these termination codons. 

pMON 15024 was used as a template for amplification of the phnO gene in a standard 
thermal amplification reaction. Briefly, a 100 microliter reaction sample was prepared which 
contained 0.1 ng template DNA, reaction buffer, 200 pM each primer, 200 mM dNTP, 1.25 U 
Taq DNA polymerase and was overlayed with mineral oil. This reaction sample was subjected 
to thirty five cycles at 94°C for one minute, 50°C for two minutes, and 72°C for three minutes 
which resulted in the amplification of a 459 base pair DNA product as determined by analysis of 
five microliters of the reaction sample on a ethidium bromide stained 0.7% TAE agarose gel. A 
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444 base pair product was purified using standard methods from a 1% TAE agarose gel after 
digestion of a sample of the 459 base pair amplification product with Ncol and EcoU restriction 
endonucleases. The 444 base pair product was ligated into- compatible sites in pMON7259 to 
generate pMON15028. Cell lysates prepared as above from IPTG induced cultures of JM101 
containing pMON15028 were analyzed for the presence of AMPA acyltransferase activity and 
compared to cultures containing pMON15027. The results were indistinguishable, thus 
confirming that phnO encoded an enzyme capable of AMPA acylation. In addition, this result 
indicated that the P2A mutation in the protein, which was introduced into the gene coding 
sequence as a result of thermal amplification using the AATPCR6 oligonucleotide primer (SEQ 
ID NO: 5), was without effect on the acyltransferase activity of the resulting PhnO protein when 
expressed in E. coli. 
Example 5 

This example illustrates the production of polyclonal antibodies directed to the PhnO 
peptide. 

Further studies of the phnO gene product required the use of antibodies directed to the 
PhnO protein. Therefore, PhnO was overproduced in E. coli JMlOlfor for use as an immunogen 
in stimulating the production of antibodies upon injection into a goat. Thep/mO gene containing' 
the P2A mutation in plasmid pMON15028 was introduced jnto plasmid pMON17061 on an Ncol 
to £coRI DNA fragment, producing pMON15032. phnO expression in pMON15032 is under 
the control of the E. coli reck promoter adjacent to the bacteriophage T7 gene 10L ribosome 
binding sequence. Cells were grown to mid log phase and induced by addition of nalidixic acid 
to the culture to approximately 50 parts per million, from a stock solution of 50 mg nalidixic 
acid powder dissolved in 1 ml 0.1 N NaOH. The culture was maintained under inducing 
conditions for twelve hours at 37°C. Cells were harvested as described in example 3, and 
sonicated in phosphate buffered saline. About 23% of the total soluble protein in the induced E. 
coli lysates was determined to be PhnO and approximately 60% of the total PhnO protein was 
released into the soluble phase as judged by SDS-PAGE and Coomassie blue staining. The 
protein was further purified by preparative SDS-PAGE providing a sufficient quantity of PhnO 
for use in producing antibody which binds to or reacts antigenically with PhnO or related AMPA 
transacylase proteins. Briefly, the PhnO protein was separated by size from other proteins in a 
15% SDS-PAGE gel. A gel slice containing the PhnO protein was excised, weighed, and 
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samples, indicating that the serum sample contains very little IgG which cross reacts with either 
E. coli or tobacco plant proteins when tested using this western blot method. 

An alternative source for r^rsrating antibody which is capable of specific binding to or 
reacting antigenically with PhnO protein was also utilized. A phnO gene was placed into a 
commercial vector (Invitrogen) containing a metal binding amino acid coding sequence (His6) 
' upstream of and in-frame with the phnO coding sequence. The H\s6-phnO DNA sequence was 
inserted into the E. coli expression vector pMON6235 on an Ncol to EcoKL fragment, under the 
control of an £. coli arabinose operon araBAD promoter, producing plasmid pMON32909. 
His6-PhnO protein was produced upon arabinose induction of £ coli W3110 cells containing 
pMON32909, and purified over a metal affinity column according to the manufacturers' 
instructions. 

His-tagged purified His6-PhnO protein standard was injected into 6 New Zealand White 
rabbits using an immunization procedure similar to that used for the goat, described above. 
Antiserum raised in these rabbits was also shown to be specific for binding PhnO protein and 
non-cross reactive with other E. coli bacterial or tobacco plant proteins. 
Example 6 

This example illustrates properties of an AMPA transacylase enzyme using 
aminomethylphosphonate and acetyl-CoA as substrates in ,an enzyme assay as measured by 
endpoint kinetic analysis. 

The apparent Km (Km) and Vmax (Vmax) of PhnO enzyme were determined for the 
substrates aminomethlyphosphonate and acetyl-CoA. Determination of the PhnO Km and 
Vmax were made by endpoint kinetic analyses, determining the enzyme velocity in consuming 
each substrate at varying substrate concentrations, and plotting the inverse of the enzyme 
velocity versus the inverse of the substrate concentration to produce a Lineweaver-Burk plot of 
enzyme kinetics. The conversion of [ l4 C]-AMPA to N-acetyl-[ 14 C]-AMPA was monitored as in 
example 2, using enzyme in a desalted crude lysate of E. coli expressing phnO from 
pMON 15032, produced as in example 4. Total protein per ml of extract was determined by the 
method of Bradford which indicated approximately 22.5 mg/ml. Densitometric scanning of 
Coomassie stained SDS-polyacrylamide gels resolving PhnO protein from these lysates indicated 
that PhnO represents about 23% of total protein, thus the cell extract was determined to contain 
about 5.2 mg PhnO protein per ml. In a first assay to determine the apparent Km and Vmax of 
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PhnO for AMP A, [ ,4 C]-AMPA concentrations ranged from 2 to 38 mM. Enzyme reactions were 
incubated at 37°C for 5 minutes and quenched with 1 volume of 100 mM sodium acetate 
(NaOAc), pH 4.4, in ethanol. Samples were analyzed by HPLC to determine the amount of 
[* 4 C]-AMPA converted to N-acetyl-[ l4 C]-AMPA. The assay conditions and output for each set 
of reactions are shown in Table 8. 
Table 8. 



PhnO Enzyme Kinetics for AMPA Substrate 



Sample 


S 1 


%Turnover~ 


Velocity* 


1/S 


1/V 


V/S 


1 


200 


39.5 


79 


1.0 


0.0127 


79.00 


2 


400 


35.1 


140 


0.5 


0.0071 


70.00 


3 


800 


32.9 


263 


0.25 


0.0038 


65.75 


4 


1200 


26.8 


322 


0.166 


0.0031 


53.67 


5 


1600 


26.2 


426 


0.125 


0.0023 


53.25 


6 


2000 


22.1 


442 


0.100 


0.0023 


44.20 


7 


2400 


19.2 


461 


0.083 


0.0022 


38.42 


8 


2800 


17.6 


493 


0.071 


0.0020 


35.21 


9 


3200 


17.3 


554 


0.063 


0.0018 


34.63 


10 


3600 


14.5 


522 


0.056 


0.0019 


29.00 


11 


4000 


13.6 


544 


0.050 


0.0018 


27.20 


12 


6000 


12.7 


762 


0.033 


0.0013 


25.15 


13 


7600 


10 


760 


0.026 


0.0013 


19.76 



1 - AMPA substrate concentration in reaction in nrri (nanomoles) 

2 - % turnover measured by the percent of N-acetyl-[ 14 C]-AMPA formed in relation to the 
amount of [ I4 C]-AMPA remaining in the sample 

3 - enzyme velocity in units of AMPA (nm) converted to N-acetyl-AMPA per minute per mg 
of protein 



A Linweaver-Burk plot of the 1/V vs 1/S data from Table 8 indicates that the apparent Km of 
PhnO for AMPA as a substrate is about 9 mM, and the apparent Vmax is about 824 U/mg 
protein. 

The apparent Km of PhnO for the substrate acetyl-CoA was determined in similar 
experiments. After several attempts to obtain end point kinetics, it was determined that the 
turnover number was too low to be reliable at AMPA concentrations of about 30 mM and 
enzyme amounts of about 1-10 ng. An alternative approach was tried using tritium labeled 
acetyl-CoA. The specific activity of the label was about 40 X higher than with [ 14 C], providing a 
gain in sensitivity that allowed for the determination of the apparent Km of PhnO for Acetyl- 
CoA. The [ 3 H]-acetyl-CoA (Amersham, Inc.) specific activity was 360 mCi/mg or 250 yCi/ml. 
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The transacylation mediated by PhnO from [ 3 H]-acetyl-CoA to [ 3 H]-acetyl-AMPA was 
monitored by weak anion exchange HPLC chromatography, with the retention times of acetyl- 
CoA and acetyl-AMPA adjusted so that these compounds were separated by about three minutes. 
This was accomplished by adjusting the concentration of KH 2 P0 4 buffer (pH 5.5) to 40 mM with 
a flow rate of 1 ml per minute over an AX100 weak anion exchange column. Each sampfe was 
reacted with PhnO and 30 mM AMPA for five minutes at 37°C and quenched with 100 mM 
NaOAc pH 4.4 in ethanol, then analyzed by HPLC. [ 3 H]-acetyl-CoA substrate ranged from 25 
micromolar to 1.3 mM in each reaction along with about 5ng PhnO, 50 mM Tris pH 7.5, 1 mM 
MnCl 2 , 1 mM MgCli, and 30 mM AMPA. Samples were analyzed by HPLC to determine the 
amounts of N-[ 3 H]-acetyl-AMPA produced, and [ 3 H]-acetyl-CoA remaining. The assay 
conditions and results for these reactions are shown in Table 9. 
Table 9. 



Sample No. 


[Acetyl- 
CoA] 1 


Velocity' 


i/[sr 


i/v* 


V/S' 


1 


25 


34 


0.0400 


0.0294 


1.3600 


2 J 


50 


66 


0.0200 


0.0152 


1.3200 


3 


. 75 


94 


0.0133 


0.0106 


1.2533 


4 


100 


125 


0.0100 


0.0080 


1.2500 


5 


125 


150 


0.0080 


0.0067 


1.2000 


6 


150 


173 


0.0066 ' 


0.0058 


1.1533 


7 


175 


193 


0.0057 


0.0052 


1.1029 


8 


200 


219 


0.0050 


0.0046 


1.0950 


9 


225 


240 


0.0044 


0.0042 


1.0667 


10 


250 


259 


0.0040 


0.0039 


1.0360 


11 


375 


339 


0.0027 


0.0030 


0.9040 


12 


390 


287 


0.0026 


0.0035 


0.7359 


13 


520 


331 


0.0019 


0.0030 


0.6365 


14 


650 


352 


0.0015 


0.0028 


0.5415 


15 


780 


372 


0.0013 


0.0027 


0.4769 


16 


910 


397 


0.0011 


0.0025 


0.4363 


17 


1040 


411 


0.0009 


0.0024 


0.3952 


18 


1170 


425 


0.0008 


0.0024 


0.3632 


19 


1300 


434 


0.0007 


0.0023 


0.3338 



2 - enzyme velocity as measured by amount of [ 3 H] incorporated into [ 3 H]-acetyl-AMPA per 
unit time 

3 - inverse substrate concentration 

4 - inverse velocity 

5 - ratio of velocity to substrate concentration _ — 



WO 00/29596 



PCT/US99/27152 



-76- 

A Linweaver-Burk plot of the 1/V vs 1/S data from Table 9 indicates that the apparent Km of 
PhnO for acetyl-CoA as a substrate is between 375-390 micromolar, and the apparent Vmax is 
about 824 U/mg protein. 

An approximate pH range of activity for the PhnO enzyme was determined using enzyme 
in a crude lysate of E coll expressing phnO from pMON 15032. The ability of the enzyrfie to 
. produce N-acetyl AMP A from a mixture containing acetyl-CoA and AMP A across a range of pH 
values was determined. The reactions were carried out in MES/MOPS/Tricine buffer 
equilibrated to a pH value from 4.5 to 9.0, with actual pH values ranging from 5.2 through 9.0. 
Briefly, 95 microliters of an appropriate buffer was mixed with 100 microliters of 2X assay mix 
as described in example 4, and 5 microliters of desalted E. coli lysate containing approximately 
400 ng/microliter PhnO protein. The reaction was incubated at 37°C for five minutes and 
quenched with 100 mM NaOAc pH 4.4 in ethanol, and analyzed by HPLC as described in 



example 4. The results are shown in Table 10. 
Table 10. 

PhnO Enzyme pH Profile 



Buffer pH 


'Mock Reaction 
pH 


Turnover 


N-Acetyl CoA 
(nmole) 


^Velocity 
(nmole/min/microgram) 


5.0 


5.23 


3.7 


222 


22.2 


5.5 


5.62 


3.9 


• .234 


23.4 


6.0 


5.92 


4.2 


252, 


25.2 


6.5 


6.47 


13.3 


798 


79.8 


7.0 


7.0 


. 27.0 


1620 


162.0 


7.5 


7.48 


32.0 


1920 


192.0 


8.0 


8.05 


34.3 


2058 


205.8 


8.5 


8.46 


33.5 


2010 


201.0 


9.0 


9.0 


33.9 


2034 


203.4 



1- indicates true pH value after combining all reagents for each initial buffer pH value given 

2- determined as in Table 9 for Km and Vmax 

3- determined as in Table 9 for Vmax 



The results indicate that optimum PhnO transacylase activity using AMPA and acetyl- 
CoA as substrates is about pH 8.0. However PhnO efficiently converts AMPA to N-acetyl- 
AMPA using acetyl-CoA as the acetyl donor across a pH range from about 6.5 to at least 9.0. 

Additional experiments were carried out with purified PhnO protein to further 
characterize the scope of the enzyme's substrate preference for acyl-CoA acyl donor compounds. 
It has been established herein that at least one substrate acyl- donor or leaving group can be a 
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two carbon acid compound such as the acetyl- moiety in the compound Acetyl-CoA. It was not 
known what range of acyl- molecules comprised of different carbon chain lengths would or 
could function as a leaving group from the acyl-CoA acyl donor when reacted with PhnO 
transacylase and AMPA as the acyl- receptor molecule. Therefor, an HPLC assay similar to that 
described in Example 2 was developed to determine the scope of the enzymes' ability to t/ansfer 
an acyl- group from an acyl-CoA acyl donor to [ 14 C]-AMPA. 

PhnO was purified from a one liter Luria Bertani broth culture of E. coli JM101 
expressing a recombinant phnO gene from pMON15032 after nalidixic acid induction for three 
hours at 37°C. Cells were harvested by centrifugation and resuspended in 40 ml cold Tris buffer 
(0.1 M Tris-HCl pH 8) and placed on ice. The cell suspension was brought to 1 mM DTT and 
0.5 mM PMSF. The suspension was lysed by 2 passages through a prechilled French pressure • 
cell at 1,100 psi, centrifuged at 12,000 g (10,000 rpm in an Sorvall SA600 rotor) for 40 min at 
4°C, then placed on ice. The cleared supematants were poured into fresh 15 ml polypropylene 
tubes. The samples were split again into two equal portions and maintained at -80°C until used 
further for purification of PhnO protein. 20 microliters of the soluble fraction was assayed for 
enzyme activity using the HPLC method described above in Example 2, except after terminate^ 
the assay with acid addition, the sample was stored at -80°C. A Sephacryl S200 column was- 
prepared according to the manufacturers' instructions and equilibrated with a solution containing 
20 mM Tris pH 8.0 and 0.5 mM MgCl 2 . The entire total soluble extract was layered over the top 
of the column bed after thawing on ice. Forty 9 ml fractions were collected from the column 
eluate, and thirty microliters of each fraction was analyzed by western blot using anti-PhnO 
antiserum after resolution on a 15% SDS-PAGE gel. Also, thirty microliters of each fraction 
was analyzed for AMPA acyl transferase activity using the method described in Example 2. 
Samples which exhibited acyl transferase activity and which corresponded to positive western 
blot data were pooled. These were represented by fractions 7 through 19 in this example, and 
were combined into a 100 ml volume, distributed into ten 10 tubes each containing 10 ml 
volumes, and stored at -80°C for further use. 

Anion exchange chromatography was used to determine the elution pattern of PhnO away 
from other contaminating proteins that co-elute during the Sephacryl S200 fractionation. One 
tube from the combined PhnO positive fractions was thawed on ice and injected into a 5/5 Mono- 
Q column pre-equilibrated with buffers A (one liter of 20 mM Tris-HCl pH 8.0 Mili-Q distilled 
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deionized water) and B (one liter of 20 mM Tris-HCl pH 8.0, 1 M NaCl). The sample containing 
PhnO active^rotein was injected into the column and one milliliter fractions were collected. The 
column was washed for five minutes with a flow rate of 1 .8 ml per minute Buffer A after loading 
the PhnO containing sample. At five minutes, Buffer B was added to the flow volume at 0.5 ml 
per minute for four minutes. Buffer B was ramped up to 22% of the flow volume at 10 mimites, 
30% at 12 minutes, 36% at 13 minutes, 41% at 14 minutes, 46% at 15 minutes, 74% at 16 
minutes, and 100% at 16 minutes through 22 minutes, at which point Buffer B flow was 
terminated and Buffer A was reinitiated at 100% to equilibrate the column. Ten microliter 
volumes from individual fractions collected from the Mono-Q column were analyzed by western 
blot and for transacylase activity as described in Example 2. Fractions which exhibited positive 
AMPA acyltransferase activity and which correlated with the Western blot data were pooled and 
maintained as a purified protein sample. Samples of this purified PhnO protein were used to 
determine enzyme's acyl donor substrate specificity. 

Enzyme reactions were prepared as follows. 100 microliter reactions consisted of 50 mM 
Tris-HCl pH 8.0, 1 mM MgCl 2 , 3 microliters of 1.3 mM [ ,4 C]-AMPA (115,392 dpm per 
microliter), 0.1 mM or 1 mM acyl-CoA acyl donor, and 2.5 microliter purified enzyme sample. 
A assay premix was' prepared from which 45 microliters was used in each 100 microliter 
reaction. This 45 microliter premix sample consisted of 40 microliters distilled and deionized 
water, 2 microliters of 50 mM MgCl 2 , and 3 microliters of l'.3 mM [ l4 C]-AMPA (115,392 dpm 
per microliter). Reactions were initiated by mixing 40 microliters of 125 mM Tris-HCl pH 8.0, 
2.5 microliters protein sample and 10 microliters acyl-CoA acyl donor compound in a 
microcentrifuge tube at room temperature. Each acyl-CoA acyl donor compound was prepared 
as a stock solution of ImM, 5 mM or 10 mM stocks; Each tube was then mixed with 45 
microliters of the assay premix containing the [ ,4 C]-AMPA receptor substrate, mixed gently and 
transferred to a 30°C water bath for 5 minutes. Each reaction was terminated with the addition 
of 4 microliters of 1M HC1 S mixed by vortexing, and placed on ice or stored at -20°C until 
assayed for the presence of [ 14 C]-AMPA or related compounds by HPLC. 

HPLC analysis was carried out using a Waters 510 dual pump HPLC system with a 481 
wavelength max UV detector and a scintillation pump, a Phenomenex PHENOSPHERE 5 
micrometer 80A SAX-silica HPLC column (250X4.6 mm, 3500 PSI max pressure), Buffer A 
consisting of 5 mM KH 2 P0 4 , 4% methanol, adjusted to pH 2.0 with H 3 P0 4 , and Buffer B 
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consisting of 200 mM KH 2 P0 4 , 4% methanol adjusted to pH 2.0 with H 3 P0 4 , and HAZARD 
Atomflow (Packard) containing 64% 1,2,4 trimethylbenzene, 7.5% sodium-dicotyl 
sulfosuccinate, 3.5% sodium diamylsulfosuccinate, and 6% polyoxyethylene(4)lauryl ether. 
HPLC gradient conditions for each sample analysis were similar to those described in Example 
2, with minor variations. The flow rates are provided in Table 11. 
Table 11. 



HPLC Gradient Conditions 



Time (min) 


Flow 


%A 


%B 


Flow Rate' 




(ml/min) 








0.0 


1 


100 


0 


3 


2.0 


1 


100 


0 


3 


5.0 


1 


50 


50 


3 


15.0 


1 


0 


100 


3 


17.0 


1 


0 


100 


3 


17.3 


1 


100 


0 


3 


21.0 


1 


100 


0 


j 


21.3 


0.1 


100 


0 


0 



1- Scintillation fluid flow rate in milliliters per minute 



Stock solutions of Acyl-CoA acyl donor compounds were prepared as described above, 
and these are listed here: Na Acetyl-CoA, Li n-propionyl-CoA, Li giutaryl-CoA, Li. 
methylmalonyl CoA, Li crotonoyl-CoA, Li isobutyryl-CoA,* Na succinyl-CoA, Li tiglyl-CoA, Li 
n-valeryl-CoA, and Li desulfo-CoA. All compounds were obtained from Sigma Chemical 
Company, St. Louis, MO. The percent activity of the purified enzyme for transfer of the CoA 
associated acyl- moiety to [ l4 C]-AMPA was determined by measuring the percentage of [ 14 C]- 
AMPA HPLC chromatogram peak area converted to some other [ I4 C]-compound, such as N- 
acetyl-[ ,4 C]-AMPA, with the amount of N-acetyl-[ l4 C]-AMPA produced during the reaction in 
which [ 14 C]-AMPA and 1 mM acetyl-CoA are substrates for PhnO being established as the 
100% reference. The results are shown in Table 12. 
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Table 12. 



AMPA Transacylase Enzyme Efficiency for Acyl-CoA Acyl Donor Substrate 



Acyl-CoA Acyl Donor 



TxpAMPA % Conversion' 



% Activity 



Acetyl-CoA 0.1 mM 
Acety-CoA 0.5 mM 
Acety-CoA 1 mM 
Propionyl-CoA 0.1 mM 
Propionyl-CoA 0.5 mM 
Propionyl-CoA 1'mM 
Glutaryl-CoA 0.1 mM 
Glutaryl-CoA 0.5 mM 
Glutaryl-CoA 1 mM 
Methylmalonyl-CoAO.l mM 
Methylmalonyl-CoA 0.5 mM 
Methylmalonyl-CoA 1 mM 
Crotonoyl-CoAO.l mM 
Crotonoyl-CoA 0.5 mM 
Crotonoyl-CoA 1 mM 
Isobutyryl-CoA 0.1 mM 
Isobutyryl-CoA 0.5 mM 
Isobutyryl-CoA 1 mM 
Succinyl-CoA 0.1 mM 
Succinyl-CoA 0.5 mM 
Succinyl-CoA 1 mM 
Tiglyl-CoAO.l mM 
Tiglyl-CoA 0.5 mM 
Tiglyl-CoA 1 mM 
Valeryl-CoAO.l mM 
Valeryl-CoA 0.5 mM 
Valeryl-CoA 1 mM 
Table 12. (continued) 



792 
98.7 

100.00 
78.2 
97.8 

100.00 
0.81 
0.00 
0.57 
1.11 
2.08 
2.21 
0.80 
0.00 
0.00 
2.10 
0.20 
0.00 
5.06 
3.38 
1.56 
0.00 
0.00 
0.99 
0.24 
0.00 
0.33 



Acyl-CoA Acyl Donor 



["q-AMPA % Conversion 7 



79.2 
98.7 

100.00 
78.2 
97.8 

100.00 
0.81 
0.00 
0.57 
1.11 
2.08 
2.21 
0.80 
0.00 
0.00 
2.10 
0.20 
0.00 
5.06 
3.38 
1.56 
0.00 
0.00 
0.99 
0.24 
0.00 
0.33 



% Activity 



0.95 
1.25 
0.52 



0.95 
1.25 
0.52 



Desulfo-CoAO.l mM 
Desulfo-CoA 0.5 mM 

Desulfo-CoA 1 mM ,_ 

1 - percentage of ["*C]-AMPA HPLC chromatogram peak area converted to some other 
[ l4 C]-compound, such as N-acetyl-[ 14 C]-AMPA, with the amount of N-acetyl-[ ,4 C]- 
AMPA produced during the reaction in which [ l4 C]-AMPA and 1 mM acetyl-CoA are 
substrates for PhnO being established as the 100% reference 



These results indicate that PhnO enzyme is capable of efficiently utilizing acyl-CoA 
associated compounds which have an acyl group with a carbon chain length of not more than 
three for transacylating AMPA. Other compounds which have a longer carbon chain length than 
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propionyl- and which are not broad or bulky, such as methylmalonly, isobutyryl-, and succinyl 
-CoA compounds are also effective acyl-CoA acyl donors, but at a lower enzyme efficiency. 
Example 7 

This example illustrates the in vitro expression and targeting of an AMPA acy [transferase 

" f 

protein into isolated chloroplasts. 

Many chloropiast-localized proteins are expressed from nuclear genes as precursors and 
are targeted to the chloroplast by a chloroplast transit peptide (CTP). The CTP is removed 
during steps involved in import of the targeted protein into the chloroplast. Examples of such 
chloroplast proteins include the small subunit (SSU) of ribulose-i,5-bisphosphate carboxylase 
(RUBISCO), 5-enol-pyruvylshikimate-3-phosphate (EPSPS), ferredoxin, ferredoxin 
oxidoreductase, the light-harvesting-complex protein I and protein II, and thioredoxin F. It has 
been demonstrated in vivo and in vitro that non-chloroplast proteins may be targeted to the 
chloroplast by use of ftisions with a CTP and that a CTP sequence is sufficient to target a protein 
to the chloroplast (Della-Cioppa et al. ? 1987). 5-enolpyruvylshikimate-3-phosphate synthetase 
(EPSPS) enzyme is located in the chloroplast and is the glyphosate target in plants. Targeting 
glyphosate oxidoreductase to the chloroplast has been found to provide tolerance to plants to 
glyphosate, although GOX localized to the cytoplasm is also able to provide such tolerance. 
Generally, recombinant GOX enzyme is localized to the chloroplast. GOX mediated glyphosate 
metabolism produces AMPA, which has been shown to be phytotoxic. It has been shown herein 
that PhnO is capable of AMPA N-acylation and that N-acetyl-AMPA is not phytotoxic. 
Therefore, it may be necessary to inactivate AMPA in plants. This assumes that AMPA 
acyltransferase can be expressed in plants as an active enzyme, and that such acyltransferases are 
capable of being imported into the chloroplast and retain enzymatic activity. In view of the 
AMPA phytotoxicity as described in example 1, an AMPA acyltransferase gene was introduced 
into plant expression vectors to test expression in plants. In addition, import of acyltransferase 
into chloroplasts was also tested. 

A DNA sequence encoding a chloroplast targeting peptide was linked 5' to and in frame 
with a DNA sequence encoding an AMPA acyltransferase. A DNA sequence encoding an 
arabidopsis ribulose-1 -bis-phosphate carboxylase small subunit chloroplast transit peptide (CTP, 
SEQ ID NO:9) was excised from pMON 17058 using Bglll and Ncol restriction endonucleases, 
and inserted into complementary restriction sites in pMON15028 to produce pMON15029, so 
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that the CTP coding sequence was linked 5' to and in frame with the phnO coding sequence in 
pMON1502$. The resulting chimeric phnO gene in pMON 15029 is capable of producing a 
chloroplast targeted PhnO protein. An EcoRl to Bglll DNA cassette containing the CTP-PhnO 
coding sequence, SEQ ID NO:ll, from pMON 15029 was inserted into EcoRl and BamHl sites 
in pBlueScript KS(-) to produce pMON 15036. The CTP-PhnO coding sequence in pMON 15036 
can be expressed in an in vitro transcription/translation system from a phage T3 promoter. A 
similar plant transient expression plasmid, pMON15035, was constructed, but without the 
chloroplast targeting sequence. An EcoRl to Bglll DNA fragment containing only the phnO 
coding sequence was excised from pMON 15028 and inserted into EcoRl and BamHl sites in 
pBlueScript KS(+) so that PhnO could be produced from a phage T7 promoter in an in vitro 
transcription/translation system. An Nco\ to EcoRl DNA sequence encoding PhnO was excised 
from pMON15028 and inserted into pMON 17061, producing pMON15032. pMON15032 
provides for expression of phnO from an £. coli recA promoter. A Bglll to EcoRl DNA 
fragment encoding PhnO was excised from pMON 15028 and inserted into pBlueScript SK(-) to 
produce pMON 15033. pMON 15033 provides for expression of phnO from an E. coli lac 
promoter. A Bglll to EcoRl DNA fragment encoding CTP-PhnO was excised from 
pMON15029 and insetted into compatible sites in pBlueScript SK(-), providing for expression of 
chloroplast targeted PhnO protein from an E. coli lac promoter from pMON 15034. 

pMON15032, pMON15033, and pMON15034 were introduced into E. coli JM101. 
Cultures were grown and induced as described above, except that expression from cells 
containing pMON 15032 was induced with addition of 50 parts per million nalidixic acid in 0.1 
M NaOH. Cleared lysates were prepared from each culture and subjected to an AMPA 
acyltransferase assay as described above in order to determine the presence of AMPA 
acyltransferase activity. All lysates contained substantial amounts of acyltransferase activity 
above control levels. More importantly, the CTP-PhnO peptide (SEQ ID NO: 12) expressed from 
pMON 15034 appeared to retain full enzymatic acyltransferase activity. 

pMON15035 (PhnO) and pMON15036 (CTP-PhnO) were used in vitro to generate [ 35 S]- 
methionine labeled PhnO protein for use in a chloroplast import assay. Briefly, the procedure 
used for in vitro transcription and translation was as described in Short Protocols In Molecular 
Biology, Third Edition, Ed. Ausubel et al., Wiley & Sons Pub., (1995), which is herein 
incorporated by reference. About 20 micrograms of plasmid DNA was digested to completion 
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with Hindlll restriction endonuclease in a 100 microliter reaction. 20 microliters of the plasrhid 
digest, or about 4 micrograms of linearized plasmid DNA, was used in an in vitro transcription 
reaction to generate mRNA for producing PhnO or CTP-PhnO -protein product in later translation 
reactions. Transcription reactions consisted of 20 microliters -of linearized plasmid DNA, 20 
microliters of a 5X transcription buffer (200 mM TrisHCl pH 8.0, 40 mM MgCl 2 , lO'mM 
•spermidine and 250 mM NaCl), 20 microliters of 5X ribonucleoside triphosphate mix (5mm 
each ATP, CTP, UTP, 5 mM diguanosine triphosphate (G-5'ppp5'-G)TP, 5 mM GTP), 10 
microliters 0.1 M dithiothreitol (DTT), 10 microliters RNasin™ (a pancreatic ribonuclease 
inhibitor mixture from Promega), 4 microliters RNA polymerase (T7 or T3, New England 
Biolabs, Inc.), and distilled, deionized water to 100 microliters. Each reaction was incubated at 
37°C for one hour. 4.5 microliters of each reaction was analyzed on a 1.4% agarose 
formaldehyde gel to ensure that each reaction produced adequate RNA template for the 
following translation step. 

20 microliters of the transcription reactions were used for producing [ 35 S]-methionine 
labeled PhnO proteins for use in a chloroplast import assay. Briefly, RNA was mixed with 6 
microliters of an aqueous amino acid mixture without methionine, 15 microliters of [ 35 S]- 
methionine (1400 Ci/mmol, Amersham), and 200 microliters of a rabbit reticulocyte lysate. 
These reactions were incubated at 37°C for two hours and placed on dry ice for storage. A 10 
microliter sample of each reaction was analyzed on a 15% SDS-PAGE gel. Gels were vacuum 
dried and placed directly onto the emulsion side of KODAK ™ X-O-MAT ™ film for 
autoradiography. The results indicated that each plasmid produced respective peptides of 
predicted molecular mass for PhnO (pMON15035) and CTP-PhnO (pMON15036) in sufficient 
quantity to test for uptake into chloroplasts in an import assay. 

Intact chloroplasts were isolated from one head of deveined Romaine lettuce according to 
Edelman et al., Methods in Chloroplast Molecular Biology, Elsevier Biomedical Press, Chap. 86, 
1982. One liter of grinding buffer (GR-buffer) stock was prepared (2 mM NaEDTA, 1 mM 
MgC12, 1 mM MnC12, 50 mM Hepes-KOH pH 7.5, and 0.33 mM sorbitol). Immediately before 
use, 890 mg of ascorbic acid was added to 900 ml of GR-buffer stock solution. One head of 
torn, deveined Romaine lettuce was mixed with 900 ml GR-buffer and emacerated by mixing in 
a Waring blender three times for three seconds each time at high speed. The slurry was filtered 
through four layers of Miracloth, and the filtrate was centrifuged at 5,000 RPM for 10 minutes at 
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4°C in a SORVALL™ GS-3 rotor. The supernatant was decanted and the pellet resuspended 
with a glass rod in 4 milliliters of GR-buffer. Chloroplasts were isolated by centrifiigation 
through a Percoli gradient. 80% Percoll was prepared by mixing 16 mis of PBF-Percoll with 4 
mis of 5X Buffer (10 mM EDTA, 5 mM MgCl 2 , 5 mM MnCl 2 , 250 mM Hepes-KOH, 30 grams 
sorbitol, 490 mg NaAscorbate, 85.5 mg glutathione to 100 mis with ddH 2 0)r A 40% Ptfcoll 
.solution was prepared by combining 8 mis PBF-Percoll with 4 mis 5X Buffer and 8 mis of 
ddH 2 0. A Percoll gradient was prepared in a 30 ml Corex tube by layering 10 mis of 40% 
Percoll onto 10 mis of 80% Percoll. Chloroplasts were isolated by layering the resuspended 
chloroplasts onto the percoll gradient, spinning at 9 ? 500 RPM for ten minutes in an SS-34 
SORVALL™ swinging bucket rotor at 4°C for ten minutes with the brake on. Broken 
chloroplasts remain in the upper layer and were pipetted off. The intact chloroplasts were 
located at the interface of the 40/80% Percoll gradient and were removed to a new 30 ml 
COREX™ tube. The isolated chloroplasts were washed two times with GR-buffer and 
centrifiiged for collection after each wash in a SS-34 rotor at 6,000 RPM for ten minutes at 4°C 
with the brake off. Isolated, washed chloroplasts were resuspended in 1 ml sterile 50 mM 
Hepes-KOH pH 7.7, 330 mM sorbitol by gently stirring with a glass rod, and the chlorophyll 
concentration of the slurry was determined. 5 mis of an 80% acetone solution was added to 20 
microliters of the chloroplast slurry and vortexed gently. The resulting mixture was filtered 
through a Whatman™ #1 filter paper into a culture tube. The absorbance of the filtrate was 
determined at 645nm and 663 nm against an 80% acetone blank. The chlorophyll concentration 
in micrograms per ml was determined according to equation #1 as [chlorophyll fig/ml] = [A 6 45+ 
[A 6 63 * (8.02)]. The mass of the chlorophyll in ng is calculated by taking the amount of 
chlorophyll measured in jig/ml and multiplying by the volume into which the chloroplasts were 
resuspended (equation #2), which is 5 mis in this example. Thus, the concentration of 
chlorophyll in \ig/\xl in the measured sample is equivalent to the value determined in equation #2 
divided by the volume of the sample measured, which in this example is 20 \xl. In this example, 
A645 was determined to be 0.496, and A 6 63 was determined to be 1.0814. Thus, the concentration 
of chlorophyll in the measured sample was 4.67 fig/fil. The concentration of chlorophyll in the 
chloroplast slurry was adjusted to 4.0 pg/^l with Hepes-KOH pH 7.7, 330 mM sorbitol solution 
and the resulting chloroplast suspension was stored on ice in the dark. 
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A typical 300 microliter uptake experiment contained 5 mM ATP, 8.3 mM unlabeled 
methionine, 322 mM sorbitol, 58.3 mM Hepes-KOH (pH 8.0), 50 microliters reticulocyte lysate 
translation products, and intact chloroplasts (about 200 microgram chlorophyll). The uptake 
mixtures were gently rocked at room temperature in 10X75 mm glass tubes, directly in front of a 
fiber optic illuminator set at maximum light intensity using a 150 Watt bulb: Two separate 70 
• microliter samples_of each uptake mix were removed at 0, 5, 10 and 15 minutes. One sample 
was centrifuged over 100 microliter silicone-oil gradients in 150 microliter polyethylene tubes by 
centrifugation at 1 1,000 X g for 30 seconds, and immediately frozen in dry ice. Under these 
conditions, the intact chloroplasts form a pellet under the silicone-oil layer and the incubation 
medium containing the reticulocyte lysate remains floating on the surface of the interface. The 
other sample was treated with protease (one tenth volume or 7 microliters of 0.25 mg/ml eachj, 
trypsin and chymotrypsin protease mixture) for thirty minutes on ice, then subjected to silicone- 
oil separation and frozen on dry ice. The chloroplast pellets were then resuspended in 50-100 
microliters of a lysis buffer (10 mM Hepes-KOH pH 7.5, 1 mM PMSF, 1 mM benzamidine, 5 
mM e-amino-n-caproic acid, and 30 micrograms per ml aprotinin) and centrifuged at 15,000 X g 
for 20 minutes to pellet the thylakoid membranes. The cleared supernatant (stromal proteins) 
from this spin, and an aliquot of the reticulocyte lysate incubation medium from each uptake., 
experiment, were mixed with an equal volume of 2X SDS-PAGE sample buffer and analyzed on 
a 15% SDS-PAGE gel, dried, and exposed to film as described above. Chloroplasts exposed to 
[ 35 S]-methionine labeled CTP-PhnO contained [ 35 S]-labeled protein of a size consistent with the 
predicted CTP- processed form of PhnO, while chloroplasts exposed to methionine labeled PhnO 
were devoid of labeled protein. Labeled protein imported into the chloroplasts was also protease 
resistant. These results indicated that PhnO could be targeted to chloroplasts when fused to a 
plastid targeting peptide sequence. 
Example 8 

This example illustrates the identification and characterization of plants transformed with 
an AMPA acyltransferase. 

A wide variety of plant species have been successfully transformed using any number of 
plant transformation methodologies well known in the art. In particular, Agrobacterium 
tumefaciens mediated plant transformation is the preferred method presently in use, however, 
ballistic methods which increase delivery of naked DNA directly to plant cells through 
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microprojectiie bombardment are also very effective in producing recombinantly transformed 
plants. In addition, methods which involve the use of liposomes, electroporation, chemicals that 
increase free DNA uptake, and transformation using viruses or pollen are alternatives which can 
be used to insert DNA constructs of this invention into plant cells. Plants which can be 
transformed by the practice of the present invention include but are not limited to corn, v^ieat, 
cotton, rice, soybean, sugarbeet, canola, flax, barley, oilseed rape, sunflower, potato, tobacco, 
tomato, alfalfa, lettuce, apple, poplar, pine, eucalyptus, acacia, poplar, sweetgum, radiata pine, 
loblolly pine, spruce, teak, alfalfa, clovers and other forage crops, turf grasses, oilpalm, 
sugarcane, banana, coffee, tea, cacao, apples, walnuts, almonds, grapes, peanuts, pulses, petunia, 
marigolds, vinca, begonias, geraniums, pansy, impatiens, oats, sorghum, and millet. DNA 
molecules for use in the present invention can be native or naturally occurring genes or chimeric 
genes constructed from useful polynucleotide sequences including promoters, enhancers, 
translated or non-translated leaders, sequences encoding signal peptides, sequences encoding 
transit peptides, structural genes, fusions of structural genes, terminators, introns, inverted 
repeats or direct repeats, linkers, and polyadenylation sequences. DNA sequences contemplated 
in this invention include single and double stranded polynucleotide sequences, linear sequences, 
and covalently closed circular polynucleotide sequences, plasmids, bacmids, cosmids, bacterial 
artificial chromosomes (BAC's), yeast artificial chromosomes (YAC's), and viral DNA and RNA 
sequences. In consideration of Agrobacterium mediated plant transformation, suitable plant 
transformation vectors include those derived from a Ti plasmid of Agrobacterium tumefaciens, 
as well as those disclosed, for example by Herrera-Estrella (1983), Bevan (1984), Klee (1985) 
and EPO publication 120,516 (Schilperoort et al.). In addition to plant transformation vectors 
derived from the Ti or root-inducing (Ri) plasmids of Agrobacterium, alternative methods as 
described above can be used to insert the DNA constructs of this invention into plant cells. 

Plasmids used for plant transformation generally were constructed from vectors which 
have been described elsewhere, particularly in US Pat No. 5,463,175 (Barry et al., 1995), which 
is herein incorporated by reference. Plasmids were constructed and maintained in £. colt using 
Tnl aminoglycoside adenylyltransferase resistance {aad gene, commonly referred to as 
streptomycin/spectinomycin or Spc/Str resistance), which is also a determinant for selection and 
maintenance in Agrobacterium. Other plasmid maintenance and selectable markers well known 
in the art for use in E. coli were also used, consisting essentially of neomycin 



WO 00/29596 



PCT/US99/27152 



-87- 

phosphotransferase, gentamycin acetyltransferase, and beta lactamase genes alone or present in 
combination on a single replicon or vector. Plasmids generally contain oriV, a replication origin 
derived from the broad host range plasmid RK2, and or/322 and bom (origin of replication for 
maintenance in £ coli, and basis of mobility for conjugational transfer respectively) sequences 
derived from plasmid pBR322. ' 

A phnO gene encoding an AMPA acyltransferase was inserted into expression cassettes 
in plant transformation vectors. These cassettes generally contain the following elements in 
sequential 5' to 3' order: a sequence comprising a plant operable promoter, a sequence encoding a 
chloroplast or plastid transit peptide, a cloning site or sites contained within a polylinker, and a 
plant functional 3' nontranslated region. Expression cassettes often are constructed to contain 
unique restriction sites flanking the cassette domain so that the entire cassette can be excised" 
from one plasmid and placed into other similarly constructed plasmid vectors. Restriction sites 
comprised of eight base pair recognition sequences are preferred, and most cassettes in the 
present invention are flanked at least on one end by a Notl restriction endonuclease recognition 
site. Preferred promoters are the figwort mosaic virus promoter, P-FMV (Gowda et aL, 1989), 
the cauliflower mosaic virus 35S promoter CaMV 35S ( Odell et al., 1985), or the enhanced 
CaMV 35S promoter (US Pat. No. 5,196,525; Kay et aL, 1987). A number of other promoters ■ 
which are active in plant cells have been described in the literature. Such promoters may be 
obtained from plants or plant viruses and include, but are not limited to the nopaline synthase 
(NOS) and octopine synthase (OCS) promoters which are carried on tumor-inducing plasmids 
generally found within virulent and non-virulent strains of Agrobactehum iumefaciens, the 
cauliflower mosaic virus (CaMV) 19S promoter, the comalina yellow mottle virus promoter, the 
sugar cane bacilliform DNA virus promoter, the peanut chlorotic streak virus promoter, the rice 
actin promoter, and the light-inducible ribulose 1,5-bisphosphate carboxylase small subunit 
promoter (ssRUBISCO). These promoters can used to create various types of DNA constructs 
useful for gene expression in plants (see for example Barry et al. US Patent No. 5,463,175). 
Particularly desirable promoters which are contemplated because of their constitutive nature are 
the Cauliflower Mosaic Virus 35S (CaMV35S) and the Figwort Mosaic Virus 35S (FMV35S) 
promoters which have previously been shown to produce high levels of expression in most plant 
organs. Other promoters which would direct tissue specific or targeted expression are also 
contemplated, for example in tissue such as leaves, meristem, flower, fruit and organs of 
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reproducuve character. In addition, chimeric promoters are also env isioned. Nopaline synthase 
gene (NOS 3') and the pea ribulose bisphosphate carboxylase synthase E9 gene (E9 3') 3' 
nontranslated termination and polyadenyiation sequences were also used. 

Expression cassettes consisting of a AMPA acyltransferase structural gene inserted 

5 downstream of a promoter and between a sequence encoding a chloroplast targeting peptide and 
a 3 ? nontranslated sequence were generally present on a plant transformation vector. Expression 
cassettes were generally flanked on either end of the cassette by a nopaline type T-DNA right 
border region on one end and a left border region on the other end, both border regions derived 
from pTiT37 (Fraley et aL 1985). Some plant transformation vectors only contained the right 

io border region, required for initiation of T-DNA transfer from Agrobacterium to the host cell. 
Most plant transformation vectors also contained a GOX (glyphosate oxidoreductase) gene, as 
described above, and in US Patent No. 5,463,175. GOX enzyme expressed from these vectors 
was generally targeted to the chloroplast when inserted into the plant genome. 

Plant transformation vectors were mobilized into the ABI Agrobacterium strain A208 

is carrying the disarmed Ti plasmid pTiC58 (pMP90RK)(Koncz and Schell, 1986). The Ti plasmid 
does not carry the T-DNA phytohormone genes which induce crown gall formation. Mating of 
the plant vector into* ABI was done by the triparental conjugation system using the helper 
plasmid pRK2013 (Ditta et al., 1980). Alternatively, tfae plant transformation plasmid can be 
introduced into the ABI strain by electroporation as described by Mattanovich et al. {Efficient 

20 transformation of Agrobacterium spp. by electroporation., Nucleic Acids Res. (1989), 17(16), 
6747), which is herein incorporated by reference. When plant tissue is incubated with the 
ABI::plant vector conjugate, the recombinant vector is transferred to the plant cells by the vir 
functions encoded by the disarmed pTiC58 plasmid. Ideally, the recombinant vector opens at the 
T-DNA right border region, and the DNA between the right and left border sequences is 

25 transferred directionally and inserted into the host plant genome, although the entire recombinant 
plant transformation vector sequence may be transferred and inserted. The pTiC58 Ti plasmid 
does not transfer to the plant cells but remains in the Agrobacterium donor. 

Recombinant plants can be regenerated from plant cells or plant tissue which has been 
transformed with a functional AMPA acyltransferase structural gene. The choice of 

30 methodology for the regeneration step is nc . critical, with suitable proto ols being available for 
hosts from Leguminosae (alfalfa, soybean, clover, etc.), Umbelliferae (carrot, celery, parsnip), 
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Cruciferae (cabbage, radish, rapeseed. etc.), Cucurbitaceae (melons and cucumber), Gramineae 
(wheat, rice, com, etc.), Solanaceae (potato, tobacco, tomato, peppers), and various floral crops. 
See for example, Ammirato, 1984; Shimamoto, 1989; Fromm, 1990; and Vasil, 1990). 
Recombinant plants which have been transformed with an AMPA acyltransferase can also be 
selected on medium containing AMPA. The appropriate inhibitory concentration of AMPA. can 
readily be determined by one of ordinary skill in the art for any particular host by screening for 
AMPA toxicity as described in example 1. Alternatively, when AMPA acyltransferase is 
transformed into plants previously transformed with GOX and selected for growth on 
glyphosate. either AMPA or glyphosate can be used as the selective ingredient for selecting for 
transformation events which express sufficient levels of AMPA acyltransferase enzyme. 
Glyphosate must be applied at levels which would otherwise be inhibitory to a recombinant plant' 
expressing GOX and selected for growth on glyphosate, due to the increased level of AMPA 
which may be produced as a result of GOX mediated glyphosate degradation. In plants which 
express recombinant GOX enzyme, exposure to increasing levels of glyphosate has been shown 
to induce yellowing or chlorosis of the leaves, stunted growth characteristics, and infertility. 
AMPA acyltransferase expressed coordinately or in combination with GOX expression can 
overcome these detrimental effects. It is also possible to use AMPA as a plant transformation 
selectable marker as an alternative to glyphosate selection. ( 
Tobacco 

Tobacco plants were transformed with a phnO gene. A tobacco leaf disc transformation 
procedure employed healthy tissue from a leaf of about one month old. After a 15-20 minute 
surface sterilization with 10% CLOROX™ plus a surfactant, leaves were rinsed three times in 
sterile water. Leaf discs were punched with a sterile paper punch, and placed upside down on 
MS 104 media (4.3 g/1 MS salts, 30 g/1 sucrose, 2 ml/1 500X B5 vitamins, 0.1 mg/1 NAA, and 1.0 
mg/1 BA), and pre-cultured for one day. Discs were then inoculated with an 1:5 diluted 
overnight culture of disarmed Agrobacierium ABI containing the subject vector (final culture 
density about 0.6 OD as determined at 550 nm). The inoculation was done by placing the discs 
in sterile centrifuge tubes along with the culture. After thirty to sixty seconds, the liquid was 
drained off and the discs were blotted between sterile filter paper. The discs were then placed 
upside down on a filter disc on MS 104 feeder plates and incubated for 2-3 days. After this co- 
culture period, the discs were transferred, still upside down, to selection plates containing MS 104 
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media. After 2-3 weeks, callus formed, and individual clumps were separated from the leaf 
discs. Shoots were cleanly cut from the callus when they were large enough to distinguish from 
stems. The shoots were placed on hormone-free rooting media (MSO: 4.3 g/1 MS salts, 30 g/1 
sucrose, and 2 ml/1 500X B5 vitamins) with selection. Roots formed in 1-2 weeks. Any leaf 
callus assays are preferably done on. rooted shoots while still sterile. Rooted "shoots were placed 
in soil and were maintained in a high humidity environment (ie: plastic containers or bags). The 
shoots were hardened off by gradually exposing them to ambient humidity conditions. 

Three tobacco transformation events, designated as lines 33476, 36779, and 37235 were 
selected for further analysis. pMON17226 (Barry et aL US Patent No. 5,463,175, 1995) was 
used to produce plant line 33476 which contains an FMV-CTP-GOX gene construct. Lines 
36779 and 37235 were produced using pMON 17261, which is a plasmid derived from 
pMON 17226 which contains Notl cassette containing an FMV-CTP-PhnO gene sequence (SEQ 
ID NO: 11) in addition to FMV-CTP-GOX. The Notl cassette was constructed as follows. The 
sequence encoding CTP, represented by SEQ ID NO: 9, was excised from pMON 17058 as a 
flg/II to Ncol fragment and inserted into pMON 15028, forming a sequence represented by SEQ 
ID NO: 11 in which the CTP coding sequence was upstream of and in frame with the PhnO 
coding sequence represented within SEQ ID NO:7. The resulting construct was designated as 
pMON 15029. The CTP-PhnO coding sequence wais excised from pMON 15029 on a BgUl to 
Sort fragment and combined with pMON17063 fragments to produce pMON15038. 
pMON 17063 was disassembled using restriction digestion to provide parts necessary for 
pMON15038 construction. pMON 17063 was digested with Sacl and Hindlll to produce a vector 
backbone into which a promoter fragment and the CTP-PhnO sequence were inserted. 
pMON 17063 was also digested in a separate reaction with Hindlll and BgUl to produce a 
fragment containing an FMV promoter sequence. The promoter fragment and the CTP-PhnO 
fragment were ligated together in a reaction along with the vector backbone fragment to produce 
pMON15038, containing a Notl cassette harboring a sequence encoding a chloroplast targeted 
PhnO peptide expressed from an FMV promoter and flanked downstream by a NOS E9 3' 
transcription termination and polyadenylation sequence. This Notl sequence was excised from 
pMON15038 and inserted into the unique Notl site in pMON17241 to produce pMON17261, 
containing a chloroplast targeted GOX coding sequence expressed from an FMV promoter and 
flanked downstream by an E9 3' sequence, along with the CTP-PhnO coding sequence and 
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expression cassette. Transformation events derived from this vector are expected not only to be 
resistant to glyphosate, but to provide resistance to AMPA phototoxicity as well. Lines 36779 
and 37235 derived from pMON 17261 were analyzed for the presence of genes encoding 
glyphosate oxidoreductase and AMPA acyltransferase by PCR, for the presence of GOX and 
PhnO enzymes by western blot, and. for the presence of metabolites produced as a result of^GOX 
' mediated [ l4 C]-glyphosate degradation by HPLC. 

Line 33476, obtained as a transformation event derived from pMON 17226, was selected 
as a "GOX only" control. Lines 36779 and 37235 demonstrated different phenotypes upon 
exposure to glyphosate and were selected as glyphosate resistant events arising after 
transformation with pMON 17261. Line 37235 became bleached or yellowed upon exposure to 
glyphosate, similar in phenotype to the GOX only line 33476. However, line 36779 displayed 
no such bleaching effect. DNA was extracted from leaf tissue for each of these events as well as 
from wt Samsun tobacco leaf, and subjected to PCR to determine the presence or absence of the 
transforming phnO gene. 

Genomic DNA isolated from transformed tobacco lines was used as the template DNA in 
a PCR reaction and reaction products were compared to wild type Samsum tobacco. PCR 
reactions consisted of 50 microliters total volume containing 10X amplification buffer, 1.5 mM 
MgCl 2 , deoxynucleotide mix with each at 1 mM, 50-100 ng genomic DNA, primers each at a 
final concentration of 16.8 pM, and 1.5 units of AmpliTaq DNA polymerase (Cetus/Perkin 
Elmer). Primers (synthesized to order by GENOSYS) consisted of the sequences as set forth in 
SEQ ID NO:21 and SEQ ID NO:22. SEQ ID NO:21 is a 20 base pair sequence capable of 
priming the synthesis of the P2A phnO gene sequence (SEQ ID NO:7) and hybridizes to the first 
twenty nucleotides of the coding sequence in that gene." SEQ ID NO:22 is also a 20 base pair 
sequence, but is capable of priming synthesis of a phnO gene from the terminal coding sequence 
into the structural coding region and hybridizes to the terminal twenty nucleotides of the 
sequence encoding PhnO. Amplification conditions consisted of three cycles of 97°C for one 
minute, 60°C for two minutes, and 72°C for two minutes, followed by 37 cycles of 94°C for one 
minute, 60°C for two minutes, and 72°C for two minutes, followed generally by a 4°C soak. 10 
microliter samples were generally analyzed by 1% TAE agarose gel electrophoresis to resolve 
the relevant bands from residual primers. Upon ethidium bromide staining of the product gels, a 
phnO gene amplification product about 432 base pairs as judged by the migration position versus 
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Hindlll digested lambda molecular weight markers appeared only in the line 33779 extracts, 
indicating the presence of the phnO gene in that line. 

Seed from Ro transformation events were obtained after self crossing in growth chamber 
conditions. Ro seed were cured and planted to generate Rl progeny. Source leaves of Rl 
progeny at the five leaf stage were exposed to [ 14 C]-glyphosate by spotting a 2 microliter sample 
onto each vein(50 microliters of [ l4 C]-glyphosate Na+ salt, 517,000 dpm/microgram, 0.42 
microgram/microliter mixed with 10 microliters of glycerol). Each leaf received several spots 
depending on the number of veins on that leaf. Three days later 15 additional 2 microliter spots 
were applied to each leaf. Two weeks later, five 2 microliter spots were applied to each of two 
leaves on each plant. These were new leaves and were not the older leaves to which glyphosate 
was initially applied. Five days after this last application, about 300 milligrams of tissue was 
sampled from two sink leaves on each plant. The samples from each plant were homogenized in 
separate 1ml volumes of deionized water, centriftiged at 9,000 RPM in a microcentrifuge, and 
the aqueous volumes were collected and stored on ice. Extracts were analyzed by HPLC for the 
presence of [ 14 C] labeled metabolites as in Example 2. The extract obtained from line 33476 
(GOX) contained only [ l4 C]-AMPA. The extract obtained from line 37235 contained non- 
metabolized [ !4 C]-glyphosate as well as a trace but measurable amount of [ 14 C]-AMPA. Only 
N-acetyl-[ ,4 C]-AMPA was observed in the extract obtained from line 36779. These results are 
consistent with the PCR data which indicated that line 36773 contained at least one copy of the 
phnO gene. In addition, the lack of a bleaching effect in line 36779 after exposure to glyphosate 
is consistent with the presence of functional GOX and PhnO enzymes and the absence of 
detectable [ ,4 C]-AMPA. 

Cotton 

A recombinant phnO gene was transformed into Coker 312 variety cotton (Gossypium 
hirsutum L.). Glyphosate tolerant cotton lines were produced by Agrobacterium mediated plant 
transformation using double border binary plasmid vectors containing either (1) gox, an 
Achromobacter sp. strain LBAA gene encoding a glyphosate-metabolizing enzyme glyphosate 
oxidoreductase (GOX), (2) the gox gene and an E. coli phnO gene encoding PhnO, or (3) the 
gox/phnO double gene construct along with an Agrobacterium strain CP4 gene encoding 5- 
enolpyruvylshikimate-3-phosphate synthase (EPSPS). All vectors are crpable of replication in 
both Agrobacterium tumefaciens and E. coli hosts, and contain an aminoglycoside 
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adenylyltransferase gene (aad) conferring resistance to aminoglycosides such as spectinomycin 
or streptomycin and providing a method for plasmid maintenance. 

pMON 17241 contains a recombinant gene consisting of a 35S FMV promoter linked 5' to 
an Arabidopsis thaliana ribulose-L5- bisphosphate carboxylase small subunit (SSU1A) gene 
sequence encoding a plastid or chloroplast targeting peptide (Timko et al., 1988) which is 
• translationally fused to a gox gene coding sequence, which is linked 3' to a 3' untranslated region, 
designated E9, from a pea ribulose-1.5-bisphosphate carboxylase gene. 

pMON17213 is a double gene plant transformation vector containing expression cassettes 
comprising (1) a 35S FMV promoter linked to a sequence encoding an Arabidopsis thaliana 
EPSPS chloroplast targeting peptide linked in-frame to a strain CP4 EPSPS coding sequence, 
which is linked 3' to an E9 3* untranslated region; and (2) a 35S FMV promoter linked to an 
SSU1A gene sequence encoding a plastid targeting peptide linked in-frame to a GOX coding 
sequence, which is linked 3* to a NOS 3' termination sequence. 

pMON 17261, described above, is a double gene plant transformation vector containing 
expression cassettes comprising (1) an FMV 35S promoter linked to an SSU1A chloroplast 
targeting peptide coding sequence linked in-frame to a GOX coding sequence, which is flanked 
downstream by the E9 3' untranslated region; and (2) an FMV 35S promoter linked to an SSU1A 
chloroplast targeting peptide coding sequence (SEQ ID NO:9) linked in- frame to a PhnO coding 
sequence (SEQ ID NO: 7), which is linked 3' to a NOS 3' sequence. 

pMON10151 is a double gene plant transformation vector containing expression cassettes 
comprising (1) an FMV 35S promoter linked to an SSU1A chloroplast targeting peptide coding 
sequence (SEQ ID NO:9) linked in-frame to a PhnO coding sequence (SEQ ID NO:7), which is 
flanked downstream by a NOS 3' sequence; and (2) an enhanced 35S promoter linked to an 
SSU1A chloroplast targeting peptide coding sequence linked in-frame to a GOX coding 
sequence which is flanked downstream by a NOS 3' sequence. 

pMON10149 is a triple gene plant transformation vector containing expression cassettes 
comprising (1) an FMV 35S promoter and a petunia HSP70 5' untranslated leader sequence 
linked to an SSU1A chloroplast targeting peptide coding sequence linked in-frame to an EPSPS 
coding sequence, which is flanked downstream by the E9 3' termination and polyadenylation 
sequence; (2) an FMV 35S promoter linked to an SSU1A chloroplast targeting peptide coding 
sequence (SEQ ID NO:9) linked in-frame to a PhnO coding sequence (SEQ ID NO:7), which is 
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flanked downstream by a NOS 3' sequence; and (3) an enhanced 35S CaMV promoter linked to 
an SSU1A chioroplast targeting peptide coding sequence linked in-frame to a GOX coding 
sequence, which is flanked downstream by a nopaline synthase 3' polyadenylation sequence 
(NOS 3'). 

Plasmid vectors were assembled in E. coli K12 strains and mated into a disarmed A^I 
Agrobacterium strain. Aminoglycoside resistant Agrobacterium strains were used to transform 
Coker 312 derived hypocotyl sections with modifications as described by Umbeck et al. (1987) 
and Umbeck (US Patent No. 5,159,135 (1992), incorporated herein by reference), except that 
plants were regenerated with modifications described by Trolinder and Goodin (1987). Selection 
for glyphosate resistance produced several lines of cotton callus, which were subsequently 
determined by PCR of genomic DNA to contain the respective genes encoding EPSPS, GOX or 
PhnO transferred from Agrobacterium. Additionally, these same callus lines were determined by 
Western blot analysis to express the desired genes. After plant regeneration, whole cotton plants 
which contained the indicated coding sequences were recovered. 

Previously identified plants transformed with a double gene glyphosate resistance 
cassette comprised of EPSPS and GOX encoding genes were determined to be resistant to 
glyphosate when applied at 48 ounces per acre through the 6-7 leaf stage, however severe 
bleaching of the leaves was observed. This phytotoxic effect was presumed to be due to the 
formation of AMPA as a result of GOX mediated glyphosate 'degradation. To test this, AMPA 
was sprayed at three different rates onto wild type Coker 312 plants. Leaf chlorosis and stunted 
growth was observed in plants at four days post-application of glyphosate at 640 ounces per acre 
and at eight days post-application of 64 ounces per acre. These results suggested that the 
phytotoxic effect observed in EPSPS/GOX transformed cotton plant lines was a result of GOX 
mediated AMPA production in plants, and that the phytotoxic effect may be obviated by co- 
expression of an AMPA acyltransferase along with GOX. To test this, cotton plants expressing 
GOX or GOX plus EPSPS alone or in combination with PhnO expression were treated with 
[ ,4 C]-glyphosate, and the metabolism of the isotope labeled glyphosate was monitored in leaf 

tissue seven days after application. 

Coker 312 glyphosate resistant recombinant cotton line 4416 was selected as a glyphosate 
resistant cotton line after transformation with pMON10149, a triple gene Agrobacterium 
tumefaciens mediated double border plant transformation vector containing chioroplast targeted 
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EPSPS, GOX, and PhnO, each expressed independently from separate 35S promoters. Several 
4416 R3 plants were raised from R2 seed. One leaf of each plantlet at the three or four stage was 
treated with a mixture of ROUNDUP ULTRA™ commercial herbicide mixture (Lot No. GLP- 
9701-7428-F) which had been fortified with [ ,4 C]-glypho.sate (Code No. C-2251). The 
ROUNDUP ULTRA™ was shown to be 30.25% glyphosate acid by weight and the f* 4 C]- 
• glyphosate had a radiochemical purity of 97.3% and a specific activity of 36.36 mCi/mmol. The 
treatment solution consisted of approximately 38 \iL containing 1.60 x 10 6 dpm with a [ ,4 C]- 
glyphosate specific activity of 1.713 x 10 3 dprn/^g glyphosate acid. Three or seven days after 
topical application the treated leaves were rinsed with water, frozen in liquid nitrogen, fractured 
with a spatula and then ground using a TEKMAR™ tissuemizer in 10 mL of water. The leaf 
extracts were adjusted to pH 3.5 - 4.0 with IN HC1 and approximately 4-8000 dpm were 
analyzed for the presence of [ l4 C]-metabolites by HPLC with liquid scintillation vial collection 
and detection (HPLC/LSC) as described in example 2. The new growth including the meristem 
and new leaves that emerged following topical application were also extracted and analyzed for 
[ l4 C]-metabolites. The results are shown in Table 13. 
Table 13. [ 14 C]-Glyphosate Metabolism In Glyphosate Resistant Cotton 



Line 
4416 
Plant# 


% [ 14 C] m 
Treate 
Glyphosate 


stabolite i 
:d Leaf Ex 
AMPA 


n Glyphosate 
.tract...* 
N-Acetyl- 
AMPA 


%r 

New C 
Glyphosat 
e 


l C] metal 
irowth E 
AMP 
A 


jolite in 
xtract...* 
N-Acetyl- 
AMPA 


MD03 


55.2 


2.5 


37.4 


nd** 


nd 


93.4 


MD04 


94.6 


2.1 


1.7 


97.9 


nd 


nd 


AOl 


48.6 


2.1 


44.7 


0.9 


0.2 


95.8 


A02 


67.3 


2.0 


29.1 


.. 0.7 


0.2 


96.5 


A03 


48.8 


2.0 


43.4 


1.2 


nd 


94.0 


A04 


19.4 


1.6 


73.9 


1.5 


nd 


94.0 


AOS 


59.9 


2.2 


31.1 


2.2 


0.2 


95.2 


A06 


38.2 


nd 


60.9 


1.5 


0.2 


93.5 


A07 


I 64.1 


nd 


26.8 


1.4 


0.5 


93.9 


A08 


90.9 

1 — : ,m-.-. — n 


2.0 


1.9 


91.2 


2.5 


1.9 



isotope observed by HPLC/LSC in each sample. 

** nd indicates that the metabolite was not detected by HPLC/LSC 



Analysis of the water rinsed glyphosate treated leaves indicated the presence of 
significant levels of N-acetyl-[ l4 C]-AMPA in eight of the ten plants tested. These levels 
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10 



15 



represented 27-74% of the isotope extracted from the treated leaves The remaining activity was 
almost entirely [ l4 C]-glyphosate. Very little of the [ ,4 C] isotope was present as [ ,4 C]-AMPA. 
The remaining two plants had very limited ability to metabolize glyphosate as indicated by the 
high levels of [ l4 C]-glyphosate remaining on or in the leaves. One of these plants also showed 
signs of stunting seven days after treatment, indicating glyphosate phytotoxicity. f 

Analysis of new growth in the ten plants tested showed that the predominant form of 
[ l4 C] labeled metabolite present was N-acetyI-[ l4 C]-AMPA at greater than 90% of the total 
radioisotope in the samples. In contrast, more than 90% of the isotope in the remaining two 
plants was in the form of [ ,4 C]-glyphosate, consistent with the analysis of the extract from the 
treatment leaf for these two plants. 

The metabolism of [ 14 C]-glyphosate in recombinant cotton lines 4268 (GOX/PhnO) and 
3753 (EPSPS/GOX) was also studied. Plants in this study were treated as indicated above for 
cotton line 4416 ? by applying droplets of ROUNDUP ULTRA fortified with [ ,4 C]-glyphosate to 
a single leaf on each plant at the three to four leaf stage. Treated leaves were harvested and 
rinsed with water, then ground and extracted, and extracts were analyzed by HPLC as described 
above for the presence of [ ,4 C]-glyphosate, [ ,4 C]-AMPA, and N-acetyl-[ 14 C]-AMPA. New 
growth, including the meristem and new leaves that emerged following application were also 
extracted and analyzed. The results are shown in Tahle 14. 
Table 14. [ I4 C]-GIyphosate Metabolism In Glyphosate Resistant Cotton 



Plant 


*% ['"C] metabolite in Glyphosate Treated 

Leaf Extract... 
Glyphosate | AMPA | N-Acetyl-AMPA 


*% [ ,4 C] metabolite in 
New Growth Extract. . . 
Glyphosate | AMPA | N-Acetyl-AMPA 


GOX/P 


hnO Plants 


B01 


76.7 


L 3 -° 


14.0 


3.4 


1.0 


89.9 


B02 


63.9 


4.8 


25.0 


1.1 


1.5 


91.5 


B03 


54.4 


3.2 


36.4 


0.8 


nd 


94.7 


B04 


58.3 


5.7 


28.9 


1.1 


1.2 


91.0 


EPSPS/GOX Plants 


C01 


59.8 


26.6 


nd 


3.72 


85.7 


nd 


C02 


92.7 


2.1 


0.8 


92.8 


0.8 


nd 


C03 


81.2 


10.7 


nd 


13.5 


72.0 


1.9 


C04 


| 86.2 


6.4 


1.0 


13.9 


76.2 


nd 


~H"C] 
isotope 
**ndi 


-Glyphosate, 
labeled metat 
ndicates that tl 


l4 C]-AM 
>olites obs 
le metabo 


PA, and N-Acetyl-["*C]-AMPA as a percentage ot total |"<JJ 
served after HPLC/LSC analysis in each sample, 
lite was not detected by HPLC/LSC. 
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Significant levels of N-acetyl-[ ,4 C]-AMPA were present in the treated leaves of all four 
line 4268 plants (GOX/PhnO; B01-B04). In contrast, N-acetyl-[ l4 C]-AMPA was not detectable 
in extracts obtained from line 3753 plants (EPSPS/GOX; C01-C04). Three of these plants 
contained significant levels of [ l4 C]-AMPA in treated leaf extracts, ranging from 6-27%. One 
line 3753 plant was deficient in the conversion of [ ,4 C]-glyphosate to N-acetyl-[ l4 C]-AMP/f, and 
. this plant also appeared to be stunted. 

90-95% of the [ )4 C] isotope in extracts of new growth from line 4268 plants was 
determined to be in the form of N-acetyl-[ l4 C]-AMPA. However, 72-86% of the [ ,4 C] isotope in 
extracts of new growth from three of the line 3753 plants was determined to be [ 14 C]-AMPA, 
with [ I4 C]-glyphosate accounting for the remainder of the isotope in these tissues. 93% of the 
isotope obtained from line 3753 plant number C02 was determined to be [ 14 C]-glyphosate, 
consistent with the lack of glyphosate metabolism in the application leaf as well as the observed 
stunting. In addition, growth regions of all line 3753 plants were discolored and yellow 
following treatment, but improved with time. By harvest, new growth leaves became mottled. 

These results are consistent with the presence of active gox and phnO gene products in 
the indicated plants. The GOX and PhnO proteins are metabolizing glyphosate to AMPA and N- 
acetyl-AMPA in the predicted manner, and line 4268 plant extracts provide a similar metabolic 
pattern to that observed with line 4416 plant extracts as judged by HPLC and by phenotypic 
observation. In both lines, the predominant [ ,4 C] product in new growth tissue extracts after 
[ 14 C]-glyphosate application is N-acetyl-[ 14 C]-AMPA. The phytotoxicity as observed by 
discoloration of plant leaves in line 3753 after glyphosate application is associated with the lack 
of an AMPA N-acyltransferase activity. In contrast, the presence of an AMPA N- 
acyltransferase activity in both the 4416 and the 4268 plant lines resulted in a lack of phytotoxic 
effects observed in line 3753 plants. 

Canola 

Canola plants were transformed with the vectors pMON17138 and pMON17261 and a 
number of plant lines of the transformed canola were obtained which exhibited glyphosate 
tolerance. Plants were transformed according to the method described in Barry et al. (US Pat 
No. 5,633,435). Briefly, Brassica napus cv Westar plants were grown in controlled growth 
chamber conditions as described. Four terminal intemodes from plants just prior to bolting or 
plants in the process of bolting but before flowering were removed and surface sterilized in 70% 



WO 00/29596 



PCT/US99/27152 



-98- 

v/v ethanol for one minute, then in 2% w/v sodium hypochlorite for twenty minutes, then rinsed 
three times with sterile distilled deionized water. Stems with leaves attached could be 
refrigerated in moist plastic bags for up to three days prior to sterilization. Six to seven stem 
segments were cut into 5 mm discs with a Redco Vegetable Slicer 200 maintaining orientation of 
s basal end. Stem discs (explants) were inoculated with 1 milliliter of Aftl Agrobacter p ium 
tumefaciens strain A208 containing a recombinant plant transformation plasmid prepared as 
described above. Explants were placed basal side down in petri plates containing 0. 1 X standard 
MS salts, B5 vitamins, 3% sucrose, 0.8% agar, pH 5.7, 1 mg/1 BA (6-benzyladenine). The plates 
were layered with 1.5 ml of media containing MS salts, B5 vitamins, 3% sucrose, pH 5.7, 4 mg/1 

io p-chlorophenoxyacetic acid, 0.005 mg/1 kinetin and covered with sterile filter paper. 

Following a 2.3 day co-culture, explants were transferred to deep dish petri plates (seven 
explants per plate) containing MS salts, B5 vitamins, 3% sucrose, 0.8% agar, pH 5.7, 1 mg/1 BA, 
500 mg/1 carbenicillin, 50 mg/1 cefotaxime, 200 mg/1 kanamycin or 1 75 mg/1 gentamicin for 
selection, and transferred after three weeks to fresh media, five explants per plate. Explants were 

is cultured in a growth room at 25°C with continuous light (Cool White). After an additional three 
weeks, shoots were excised from the explants, and leaf recallusing assays were initiated to 
confirm modification of Ro shoots. Three tiny pieces of leaf tissue were placed on recallusing 
media containing MS salts, B5 vitamins, 3% sucrose,' 0.?% agar, pH 5.7, 5 mg/1 BA, 0.5 mg/1 
naphthalene acetic acid (NAA), 500 mg/1 carbenicillin, 50 m^/1 cefotaxime, 200 mg/1 kanamycin 

20 or gentamicin or 0.5 mM glyphosate. The leaf assays were incubated in a growth room under the 
same conditions as explant culture. After an additional three weeks, the leaf recallusing assays 
were scored for herbicide tolerance (callus or green leaf tissue) or sensitivity (bleaching). 

Each shoot stem was dipped in ROOTONE at the time of excision, placed in a two inch 
pot containing Metro-MIX 350, and maintained in a closed humid environment in a growth 

25 chamber at 24°C, 16/8 hour photoperiod, 400 uE per square meter per second (HID lamps) for a 
hardening-off period of approximately three weeks. 

Plasmid pMON17138 is an Agrobacterium mediated single border plant transformation 
vector maintained in the bacterium by selection on streptomycin or spectinomycin. pMON17138 
contains a single right Ti border flanking the 3' end of the genetic elements desired to be 

30 transferred into the plant genome. This vector contains two plant operable expression cassettes. 
One cassette is comprised of a caulimovirus 35S promoter driving expression of a neomycin 
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phosphotransferase gene {npt\l\ flanked downstream by a nopaline synthase 3' transcription 
termination and polyadenylation sequence (NOS 3'). The other cassette is comprised of a 
figwort mosaic virus promoter (described in Rogers, US Pat No. 5,678,319) upstream of a pea 
ribulose bisphosphate carboxylase small subunit transcription termination and polyadenylation 
sequence. A chloroplast targeted glyphosate oxidoreductase (GOX) coding sequence is incited 
♦ between the promoter and pea 3' sequence. 

Plasmid pMON 17261 is an Agrobacterium mediated double border plant transformation 
vector similar to pMON17138. A chloroplast targeted GOX encoding cassette identical to that in 
pMON17138 is present downstream from a Ti right border, and upstream of an additional plant 
operable expression cassette comprised of a figwort mosaic virus promoter (P-FMV) linked to a 
NOS 3' sequence. A chloroplast targeted PhnO coding sequence is inserted between the second " 
P-FMV and NOS3' sequences. 

Ri plants derived from transformation events using pMON17261 and pMON17138 were 
evaluated using a glyphosate spray test described in Barry et al. (US Pat No. 5,633,435). 

Corn 

An AMPA acyltransferase gene has also been introduced into Black Mexican Sweet corn 
cells with expression of the gene and glyphosate resistance detected in callus. Callus tissue was 
transformed according to the method described in Barry et al. (US Pat. No. 5,463,175). Various 
plasmids were used to introduce glyphosate resistance genes encoding GOX and EPSPS in 
combination with an AMPA acyltransferase gene into corn cells. These plasmids differed from 
each other with respect to promoters used, chloroplast or plastid targeting peptide sequences 
used, untranslated leader sequences used, presence or absence of an intron, and type of 3 f 
terminator used, however all plasmids contained a synthetically derived AMPA acyltransferase 
gene encoding PhnO containing the P2A mutation. The synthetic gene was constructed from 
three smaller polynucleotide sequences synthesized for Monsanto and characterized for the 
presence of the desired DNA coding sequence and amino acid sequence translation by 
Stratagene, Inc., La Jolla, CA. The non-naturally occurring gene was assembled from three 
smaller sequences comprised of SEQ ID NO: 16, SEQ ID NO: 17, and SEQ ID NO: 18, wherein 
the fully assembled gene is represented by SEQ ID NO: 19, and is present in each of the plasmids 
used for the corn callus transformation. The non-naturally occurring gene coding sequence was 
established based on the method described in Fishhoff et al. in US Patent No. 5,500,365 in which 
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monocot preferred codons were used in place of those preferred by E. coli. The fully assembled 
gene encodes a full length PhnO protein identical to the native protein sequence with the 
exception of the P2A mutation introduced by PCR using SEQ ID NO:5 and SEQ ID NO:6 to 
engineer appropriate restriction endoruolease recognition sites into the flanking ends of the 
coding sequence. Plasmids which were used in generating the corn callus data are shown in 
Table 15 along with differences with respect to genetic elements flanking the AMPA 
acyl transferase encoding sequence. 

Table 15. Corn Callus Transformation Plasmids and Relevant Genetic Elements 



Plasmid 



Relevant Genetic Elements* 
[Pe35S / l-Zm.Hsp70/ CTP / phnO / T-At.Nos]-> GOX EPSPS 



pMON32926 -, - , 

pMON3293 1 [Pe35S / I-Zm.Hsp70 / phnO / T-At.Nos] -> GOX -» EPSPS 
pMON32932 [Pe35S / I-Zm.Hsp70 / CTP / phnO / T-At.Nos] ■> GOX -> EPSPS 
pMON32936 [P-Os.Actl / I-Os.Actl / CTP / phnO / T-At.Nos]^ GOX -> EPSPS 
pMON32938 [P-Os.Actl / I-Os.Actl / CTP / phnO / T-At.Nos] GOX -> EPSPS 
pMON32946 [Pe35S / L-Ta.Cab / CTP / phnO / T-Ta.Hsp70] ]-> GOX -» EPSPS 
pMON32947 [Pe35S / L-Ta.Hsp70 / CTP / phnO / T-Ta.Hsp70] -» GOX -> EPSPS 
pMON32948 EPSPS-* [Pe35S / I-Zm.Hsp70 / CTP / phnO / T-At.Nos] -» GOX 
pMON32950 EPSPS-»[Pe35S / I-Zm.Hsp70 / CTP / phnO / T-At.Nos]-» GOX 
pMON32570 EPSPS -»[Pe35S / L-Ta.Cab / I-Os.Actl / CTP / phnO / T-Ta.Hsp70]-»GOX 
pMON3257 1 EPSPS -»[Pe35S / L-Ta.Cab / I-Os.Actl / CTP / phnO / T-Ta.Hsp70] ->GOX 
pMON32572 EPSPS -»[Pe35S / L-Zm.Hsp70 / I-Os.Actl /CTP / phnO / T-Ta.Hsp70]-»GOX 
pMON32573 | EPSPS ->[Pe3SS / L-Ta.Cab / I-Os.Actl /CTP / phnO / T-Ta.Hsp70]-»GOX 
* Genetic elements contained within PhnO expression cassettes as indicated in each plasmid. 
Elements are shown in the order in which they appear in the plasmid, along with the presence of 
other genes encoding herbicide resistance, if present, flanking' the PhnO expression cassette. 
-> indicates the direction of transcription of each gene or genes flanking the PhnO expression 
cassette. Individual elements are described in the text. 

Promoters which were used included the CaMV e35 S promoter and the rice actin 
promoter (P-Os.Actl). Introns which were used included those obtained from plant genes such 
as corn Hsp70 (I-Zm.Hsp70) and rice actin (I-Os.Actl). Non-translated leader sequences which 
were used included wheat chlorophyll a/b binding protein (L-Ta.Cab) and corn Hsp70 (L- 
Zm.Hsp70). Termination and polyadenylation sequences which were used included 
Agrobacterium tumefaciens NOS 3* (T-At.Nos) and wheat Hsp70 (T-Ta.Hsp70). The same 
chloroplast targeting sequence was used in all PhnO expression cassettes, represented by SEQ ID 
NO: 9. 

A [ l4 C]-glyphosate metabolism assay was used for determining whether transformed 
corn callus tissues contain functioning forms of these enzymes. The assay was developed to 
screen large numbers of corn callus samples. Callus was obtained from Monsanto Company and 
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Dekalb Seed Company corn transformation groups. The Monsanto callus samples, individually 
designated as callus lines " 1 9nn-nn-nn : ' in Table 16, were produced from HI II X B73 corn 
embryos. Callus samples were bombarded with complete covalently closed circular recombinant 
plant transformation vector plasmid DNA or with linear DNA fragments isolated from such 
plasmids 25-50 days after embryo isolation. Transformed lines were identifies 8-14 week^ after 
bombardment. These lines were sub-cultured on fresh media every 2 weeks and were 5-7 
months old when used in the metabolism assay. The Dekalb callus lines 00, OR, OW, OX, and 
OY were obtained from HI II x AW embryos. All line designations correspond to the 
recombinant plasmid or linear fragment used for ballistic transformation of callus tissue as noted 

in the legend to Table 16. 

4.5 mCi of N-phosphono-[ ,4 C]-methylglycine ([ l4 C]-glyphosate) was obtained from the 
Monsanto Radiosynthesis group in a 1.5 mM aqueous solution, having a specific radioactivity of 
39.4 mCi/mM (5.2 X 10 5 dpm/microgram). The sample was identified with code number C- 
2182.2. A stock solution sterilized by filtration through a 0.2-micron Acrodisk (Gelman no. 
4192) was prepared by combining 2.5 mL [ I4 C]-glyphosate (3.3 X 10 8 dpm) with 2.5 mL of 
corn callus growth medium (N6 medium) and 5.0 mg of Mon 0818 surfactant. [ ,4 C]-glyphosate 
in the resulting dose solution was 0.75 mM. The N6 medium was described by Chu et al. 
(1975) and was prepared using salts and vitamins obtairied from Sigma Chemical Company, St. 
Louis, MO. Mon 0818 surfactant is ethoxylated tallowamine, the surfactant used in Roundup 
herbicide. The dose solution was subjected to HPLC analysis as described in Example 2. The 
results are shown in a chromatogram illustrated in Figure 1. Three radioactive peaks were 
resolved, the largest of which corresponded to glyphosate (1 1.3 min, 98.8%). Impurity peaks 
corresponding to [ l4 C]-AMPA (5.8 min, 0.16%) and an unidentified material (10.2 min, 1.0%) 
were also present in the dose solution. No peaks corresponding to N-acetyl-[ l4 C]-AMPA were 
present in the dose solution. Two additional dose solutions were prepared using these reagents, 
each of which were scaled three fold to 15 ml volumes based on the preparation method 
described above. 

N-acetyI-[ I4 C]-AMPA was synthesized for use as a retention time HPLC standard. 1 mL 
of pyridine and 2 mL of acetic anhydride was added to a 20-mL screw cap culture tube and 
chilled on ice. 0.1 mL of an aqueous solution of [ l4 C]-AMPA (6.2 X 10 6 dpm, code C-2127.2) 
was added to the chilled solution. The tube was then removed from the ice bath and warmed to 
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about 50-60°C. A 10-^iL sample was removed after about 30 minutes and combined with 0.5 
mL of water and analyzed according to the HPLC method set forth above. [ ,4 C]-AMPA was not 
detected, however two new radioactive peaks were identified; one peak at 13.9 minutes (68%) 
and the other at 15.4 minutes (32%). A sample of the material eluting at 13.9 minutes was 
isolated and analyzed by negative ion electrospray mass spectrometry. The resuh showed strtfng 
ions at m/e 152 and 154, as expected for this compound, which has a molecular weight of 153 
Daltons; the m/z 154 ion was due to the isotopic [ ,4 C ]atom. The radioactive peak eluting at 15.4 
minutes was not isolated. However, in a separate HPLC experiment, it was shown to co-elute 
with synthetic N-acetyl-N-methyl-AMPA. N-methyl-[ 14 C]-AMPA has previously been shown to 
be an impurity in the initial [ 14 C]-AMPA material. 

Under aseptic conditions, corn callus samples were transferred to individual wells of 
sterile 48-well COSTAR™ ceil culture clusters (cat. No. 3548). The individual callus samples 
were not weighed. However, in several cases the total weight of the callus samples in a 48-well 
plate was determined. Typically, the average weight of individual callus samples was 
approximately 200-250 mg. In each assay, a nontransformed callus sample, HI II X B73, was 
included as a control. 50 of dose solution containing 3.3 X 10 6 dpm of [ 14 C]-glyphosate was 
added to each callus sample. 48-well plates were sealed with parafilm and placed in a plastic bag 
containing a wet paper towel to provide a moist atmosphere. Bags were closed and placed in a 
dark drawer at 25°C for 10 days. Each callus sample was subsequently transferred to a labeled 
microcentrifuge tube (VWR, 1.7-mL, cat. No. 20170-620). 1.0 mL of de-ionized water was 
added to each tube, and the tubes were closed and placed in round 20-tube floating 
microcentrifuge racks (Nalge cat. no. 5974-1015). These microfuge tubes were floated in 
boiling water for 30 minutes, shaken using a vortex mixer, and centrifuged for 5 minutes using a 
Fisher brand microcentrifuge. 120-|iL supernatant samples were removed for analysis by HPLC 
as described below. The samples were injected using a Waters WISP autoinjector. 
Chromatographic profiles were obtained for each sample analyzed, and quantitative information 
was obtained by extrapolating the area under the radioactive elution peaks to total [ C] in each 
sample. Figure 2 shows an HPLC profile of a mixture of standards of the observed radioactive 
metabolites [ ,4 C] AMPA, [ l4 C] glyphosate, and N-acetyl-[ 14 C]-AMPA and the impurity 
identified as N-acetyl-N-methyl-[ ,4 C]-AMPA. 
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HPLC analysis was typically completed using a SPHER1SORB™ S5 SAX 250 mm x 
10mm column for most analyses. Some samples were analyzed on an ALLTECH™ 5-micron, 
250 x 10 mm SAX column, which provided similar performance. Two solvents were prepared. 
Solvent A consisted of 0.005 M KH 2 P0 4 , adjusted to pH 2.0 with H 3 P0 4 and contained 4% 
methanol. Solvent B consisted of 0.10 M KH 2 P0 4 , adjusted to pH 2.0 with H 3 P0 4 and also 
■ contained 4% methanol. The eluent flow rate was set at 3 mL/min, and the scintillation fluid 
flow rate was set at 9 mL/min using ATOMFLOW™ scintillation fluid (No. NEN-995, from 
Packard Instruments). All column solvent steps were linear, with the injection and column 
solvent flow rates as indicated in example 2. The column is prepared for an additional injection 
at 20 minutes. 

Callus samples from 359 transformed com lines were combined with 50-uL aliquots of 
[ l4 C]-glyphosate dose solution and incubated for 10 days in the dark. Each post-incubation 
callus sample, together with its clinging dose material, was transferred to a 1.7-mL 
microcentrifuge tube along with 1 mL of water, and each tube was placed in boiling water. This 
step causes cell lysis, releasing soluble intracellular compounds including any isotope labeled 
compounds such as glyphosate, AMPA, and N-acetyl-AMPA. It was determined during method 
development that if the post-incubation calli were rinsed thoroughly with water, 85-95% of the 
radioactivity was rinsed off, and HPLC analysis showed that virtually all of the radioactivity in 
the rinses was due to [ l4 C]-glyphosate and none was attributable to [ ,4 C]-metabolites. In these 
experiments, the rinsed calli gave extracts containing [ l4 C]-metabolites in addition to [ l4 C]- 
glyphosate. This indicated that the radioactivity in the rinses was due mainly, if not exclusively, 
to unabsorbed surface [ 14 C]-glyphosate. It is important to take this into account when 
considering the rather low percentages of the dose converted to metabolites, because the 
percentage calculation includes large amounts of unabsorbed surface radioactivity. The method 
development work also showed that simply boiling the incubated calli in water released as much 
radioactivity as could be released by a conventional grinding/extracting procedure. Experiments 
were conducted to show that oiling did not alter the metabolite profiles. The streamlined 
procedures made it possible to analyze large numbers of samples (e.g., 96) at one time. Table 16 
shows representative data of the callus samples producing the highest levels of N-acetyl-[ 14 C]- 
AMPA or [ 14 C]-AMPA obtained after HPLC analysis. A representative chromatogram of a 
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GOX plus AMPA acyltransferase transformed, glyphosate treated, callus extract sample 



shown in Figure 3. 
Table 16. 

Transformed Corn Callus Lines Producing Amounts of AMPA or N-Acetyk 



Callus Producing N-Acetyl-[ ,4 C] : AMPA 


Callus 


Producing 


»AMPA , 


Callus * 


Transformed 
with... 


Percent ** 
N-Acetyl-[ 14 C]- 
AMPA 


Callus * 


Transformed 
with... 


Percent ** 
I CJ-AMrA 


1978-05-02 


pMON32570 


0.27 






2.89 


1978-08-01 


pMON32570 


0.94 




nMO"N12926 


2.00 


1978-20-02 


pMON32570 


0.57 






5.00 


1978-21-02 


pMON32570 


0.23 




nMON12926 


2.00 


1978-22-01 


pMON32570 


0.90 


r\nc in 




5.08 


1978-24-02 


pMON32570 


1.80 


1 V / 1-U5-U 1 


piVlWlN JA7 j£ 


2.64 


1978-35-01 


pMON32570 


0.22 


1971-2 /-U3 


piviupi j&y j t. 


3.63 


1980-01-01 


pMON32570 


0.27 


UUjU!) 




2.73 


1980-03-01 


pMON32571 


0.22 




pMON32932 


2.86 


1981-28-01 


pMON32571 


0.25 


005 10 


pMON32932 


2.34 


1981-02-01 


pMON32572 


0.65 


00512 


pMON32932 


2.31 


1981-03-01 


pMON32572 


L 0.74 


00514 


pMON32932 


1.98 


1981-18-01 


pMON32572 


0.22 


00535 


pMON32932 


2.88 


1981-23-01 


pMON32572 


0.48 


00538 


pMON32932 


2.70 


1981-24-02 


pMON32572 


0.29 


00539 . 


pMON32932 


1.97 


1981-32-02 


pMON32572 


1.08 


00553 


pMON32932 


3.56 


1977-05-03 


pMON32573 


0.39 


00576 


P MON32932 


3.49 


OR516 


pMON32926 


1.91 


00579 


pMON32932 


2.85 


1972-14-01 


pMON32931 


0.40 


1986-17-01 


pMON32936 


2.29 


1972-32-01 


pMON32931 


0.75 


1986-18-03 


pMON32936 


3.05 


1972-33-01 


pMON32931 


0.55 


1986-18-04 


pMON32936 


2.15 


00544 


pMON32932 


0.28 


1986-28-02 


pMON32936 


2.06 


1986-06-01 


pMON32936 


0.30 


1983-12-02 


pMON32938 


2.41 


1986-08-01 


pMON32936 


1.13 


1983-31-01 


pMON32938 


2.90 


1986-08-03 


pMON32936 


0.70 


1985-03-02 


pMON32946 


2.51 
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Table 16. (continued) 



Callus Producing N-Acetyl-( I4 C]-AMPA 



Callus Producing [ ,4 C|-AMPA 



Callus * 


Transformed 
with... 


Percent ** 
N-Acetyl-[ ,4 C]- 
AMPA 


Callus * 


Transformed 
with... 


Percent ** 
[ l4 C]-AMPA 


1986-12-01 


pMON32936 


0.33 


1985-38-01 


pMON32947 


1.99 > 


1986-18-02 


pMON32936 


0.40 


0X512 


pMON32948 


2.43 


1986-18-03 


pMON32936 


0.51 


0X533 


pMON32948 


3.91 


1986-18-04 


pMON32936 


1.09 


0X556 


pMON32948 


12.11 


1986-22-04 


pMON32936 


0.64 


OY504 


pMON32950 


2.25 


1983-11-01 


pMON32938 


0.21 


OY511 


pMON32950 


2.53 


OW534 


pMON32946 


0.77 


OY528 


pMON32950 


2.58 


OW542 


pMON32946 


0.85 


OY532 


pMON32950 


2.24 


1985-26-01 


pMON32947 


0.60 


OY534 


pMON32950 


4.02 


1985-26-03 


pMON32947 


0.71 


OY535 


pMON32950 


2.34 


1985-11-04 


pMON32952 


0.37 


OY540 


pMON32950 


5.57 



with isolated linear fragments of plasm ids. Linear fragments were isolated so as to be separate from 
piasmid backbone structure. 

** percent radioactivity detected for N-Acetyl-[ ,4 C]-AMPA or [ ,4 C]-AMPA peaks determined as a 
fraction of the total amount of radioactivity in the sample, including residual [ ,4 C]-glyphosate as 
described in the text. „ 



19 of the 359 callus samples tested produced extracts containing N-acetyl-[ ,4 C]-AMPA at 
a level distinctly higher than the other callus samples. Callus OR516 was the strongest in this 
respect and was analyzed five times during a period of two months, providing values ranging 
from 0.50-4.54% (average 1 .91%). The basis for the relatively large spread in the percentage of 
N-acetyl-[ l4 C]-AMPA formed at various times is unknown. In four of the five analyses of 
OR516, the percentage of N-acetyl-[ ,4 C]-AMPA present was higher than that of [ l4 C]-AMPA, 
indicating an efficient conversion of [ I4 C]-AMPA to N-acetyl-[ l4 C]-AMPA. The callus next 
most efficient in producing N-acetyl-[ I4 C]-AMPA was 1978-24-02, which was the only other 
callus besides OR516 that contained more N-acetyl-[ ,4 C]-AMPA than [ |4 C]-AMPA in its extract. 
One hundred of the 359 callus samples tested produced extracts containing [ 14 C]-AMPA at a 
level distinctly higher than other callus samples. 0X556 was a superlative producer of [ I4 C]- 
AMPA, yielding more than twice as much of the metabolite as any other callus in the study. The 
control callus, HI II X B73, which contained no inserted genes, produced no detectable levels of 
N-acetyl-[ ,4 C]-AMPA and only background levels of [ l4 C]-AMPA. This result indicates that 
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expression of an AMP A acyltransferase in corn is effective in conversion of AMP A produced as 
a result of GOX mediated glyphosate degradation to N-acetyl-AMPA. 
Wheat 

GOX mediated glyphosate degradation has been shown to produce AMPA, and AMPA 
has previously been shown to be the source of phytotoxic effects. Therefor, effects of wheat 
plant exposure to the compounds AMPA or N-acetyl-AMPA was determined as in example 2 in 
order to observe any wheat sensitivity or insensitivity to either of these compounds. The 
observation of any phytotoxic effects would indicate that GOX mediated glyphosate metabolism 
would be detrimental to Triticum species. 

Wheat immature embryos were exposed to different concentrations of AMPA and N- 
acetyl-AMPA in a wheat embryo germination assay. MMSO base media was prepared 
containing 40 grams per liter maltose, 2 grams per liter GELRITE™, MS salts, and vitamins. 
Salts, vitamins, and maltose were dissolved in 3500 ml water and the pH was adjusted to 5.8. 
500 ml was dispensed into a separate bottle along with 1 gram of GELRITE™ and autoclaved 
for 17 minutes. After the medium had cooled to about 45°C, AMPA or N-acetyl-AMPA was 
added to a defined concentration. The mixture was poured into six square Sundae cups under 
sterile conditions and allowed to solidify. 

Immature wheat embryos were isolated from twenty day old seedlings (after anther 
formation) and inoculated into each MMSO media. Each Sundae cup contained nine immature 
embryo's. Three separate plates were used for each concentration of AMPA (0, 0.1, 0.15, 0.2, 
0.25, 0.3, and 1.0 mM) or N-acetyl-AMPA (0, 0.1, 0.3, 1.0, and 3.0 mM). Sundae cups were 
incubated for ten days and the length of roots and shoots were determined and compared. The 
results are shown in Table 17. 
Table 17. 



Comparison of AMPA and N-acetyl AMPA on Germinating Shoot and Root Length 



Phosphonate Compound 


Shoot (cm) 


Root (cm) 


AMPA (mM) 






0.00 


12.6 ±2.6 


7.0 ±1.9 


0.10 


11.7 ±2.5 


8.0 ±2.0 


0.15 


11.3 ±2.1 


6.3 ±1.7 


0.20 


9.2 ±1.8 


4.6 ±2.1 


0.25 


8.5 ± 1.8 


3.1 ±1.6 


0.30 


6.6 ±1.8 


2.6 ±1.6 


1.00 


0.9 ±0.1 


I 0.4 ±0.1 
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N-Acetvl-AMPA 




0.00 


12.6 ±2.6 


7.0 ± 1.9 


0.10 


12.012.4 


5.9 ± 1.4 


0.30 


11.7 ±3.5 


5.2 ± 1.2 


1.00 


12.2 ±3.2 


5.4 ± 1.5 


3.00 


11.2 ±2.6 


5.9±1.6 t 



AMPA was not substantially inhibitory to growth and elongation of immature embryo's 
at concentrations under 0.2 mM. However, concentrations above 0.2 mM were severely 
inhibitory to both shoot and root elongation, indicating that AMPA may also be phytotoxic to 
wheat and. considering the nature of the monocot crop species as a whole, phytotoxic to other 
monocotyledonous crops as well as turf grasses. Germination of immature embryo's was 
significantly affected when the AMPA level was higher than 0.20 mM. 1.00 mM AMPA 
eliminated the germination of immature embryo's in wheat. In contrast, N-acetyl-AMPA was not 
inhibitory to shoots and only mildly inhibitory to root elongation at any concentrations tested in 
this experiment. The highest N-acetyl-AMPA concentration tested was greater than ten times 
the minimal non-inhibitory concentration determined for AMPA. There are no significant 
effects to immature embryo germination when the N-acetyl-AMPA concentration is less than 3.0 
mM. This result indicates that N-acetylation of, AMPA in wheat would prevent AMPA 
phytotoxicity arising as a result of GOX mediated glyphosate herbicide metabolism. 

Recombinant glyphosate tolerant wheat plants were generated according to the 
method of Zhou et al. (Plant Cell Reports 15:159 -163, 1995). Briefly, spring wheat, Triticum 
aestivum cv Bobwhite, was used as the target transformation line. Stock plants were grown in an 
environmentally controlled growth chamber with a 16 hour photoperiod at 800 microJoule per 
square meter per second provided by high-intensity discharge lights (Sylvania, GTE Products 
Corp., Manchester, NH). The day /night temperatures were 18/16°C. Immature caryopses were 
collected from the plants 14 days after anthesis. Immature embryos were dissected aseptically 
and cultured on MMS2 medium, a Murashige and Skoog (Physiol. Plant 15:473-497, 1962) basal 
medium supplemented with 40 grams per liter maltose and 2 milligrams per liter 2,4-D. In some 
experiments, CM4 medium was used. CM4 medium contains is MMS2 medium, but contains 
only 0.5 millig ams per liter 2,4-r md includes 2.2 milligrams per liter picloram. The immature 
embryos were cultured at 26°C in the dark. 
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Immarure embryos were referred Ave days after culrur- initiation to an osmonc 
tmaunen, CM4 medium contaimng 0.35 M — four hours prior . bombards, accordmg 
,o the rne.hod of Ruasel, e, * fln Virro Cell Devel. Bio,., 28P:97-,05, .992). Thtn, to forty 
embr yos were placed in rhe earner of each plateand bombarded in a DuPon. PDSI000 appals. 
PUtsmid DNA was adsorbed onro 1 pan tungsten panicles accotdtng ro me method of Sanfo,d e, 
al (Particle Sci. Technol., 5:27-37, 1987). Embryos were bombarded rwice a. a distance of 13 
' ™, from rhe stopping plate. A ,00 pm stainless steel screen was placed immediately below the 
stopping plate. 

After a 16 hour post bombardment treatment on the osmotic medium, the embryos were 
^nsferred to MMS2 or CM4 medium. Following a one week delay, .he embryos were 
transferred to the MMS2 or CM4 median, containing 2 mM glyphosate. After 9-12 weeks of 
callus proliferate on the selection medium, calli were transferred to a MMS0.2 regenerate 
nredium containing 0.2 mg/l 2,4-D and 0.1 mM glyphosate. Shoors obtained from the 
regeneration medium were referred ro MMS0 wirhou, 2,4-D but conraining 0.02 mM 

''^yphosate tolerant Ro plants as well aa R, progeny were transferred to ,5 centtmeter 
diameter pots and grown in an environmentally controlled chamber as described above. Two 
weeks later, rhe plants were sprayed with 3 ml/liter ROUNDUP (41% acttv. tngredtent, 
Monsanto Company) in a spray chamber, which was designed to mimic a field dose appltcatron 
. of 0.6 kilograms glyphosate per hectare. Damage symptoms ware observed and recorded a, 

different stages following the spraying. 

Genomic DNA was iaolared from leaf tissue of R, and R, progeny followtng rhe method 
of Shore e. al. (Cell 35:225-233, ,983). Fifteen micrograms of genomic DNA was digesred wtdr 
*fl restriction endonuclease and fraofionated on a 0.8% agarose gel. The DNA was 

u barred ,„ Hybond N membranes (Amersham, according .0 the standard procedure desenbed 
in Sambrook a, al. (Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laborer, 
Press, 1989). The membranes were probed independently ** * P«~ ° f 
EPSPS and GOX. A 3.4 kb DNA fragment containing the EPSPS gene and a 4.8 kb DNA 
fragment conraining .be GOX gene warn released from P MON,9574 by «n " 

30 endonuclea* digestion, isolated by 0.7,. agarose gal elechophoresh and label* - I 
dCTP using a Straragene PRIME-IT 1. random primer labeling kit Probes wet* labeled ro 
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specific activity of 3 X 10 9 counts per minute per microgram and 1.3 X 10 9 counts per minute 
per microgram, respectively. Membranes were hybridized for 14 hours at 42°C in a solution 
containing 50% formamide, 5X SSC. 5X Denhardt's, 0.5% SDS, and 100 microgram per 
milliliter tRNA. The condition of the final wash was 0.1% SSC and 0.1% SDS at 60°C for 
fifteen minutes. 

EPSPS and GOX protein assays were conducted using crude protein extracts from leaf 
tissue of R 0 plants and total proteins were quantified following the method of Bradford (Anal. 
Biochem. 72:248-256, 1976). The percentage of EPSPS and GOX protein represented in the 
extracts was quantified using an ELISA method and calculated as percent total extractable 
protein. 

Immature embryos from the R 0 transgenic and Bobwhite control plants were isolated 
twenty days after anthesis and cultured on the MMS0 medium with 0.02 mM glyphosate for a 
germination test. Germinated and non-germinated embryos were recorded ten days later and the 
data was analyzed by % 2 test for 3:1 segregation. Tolerant plants from the germination test were 
transferred to soil and sprayed with three milliliters per liter of ROUNDUP as described above. 

Five plasmids harboring glyphosate resistance genes were used to transform immature 
wheat embryos as described above. pMON19338 contains a nucleotide cassette encoding a 
petunia EPSPS chloroplast transit peptide in frame with an Agrobacterium strain CP4 EPSPS 
enzyme sequence. The nucleotide cassette is inserted downstream of a cauliflower mosaic virus 
enhanced 35S promoter linked 3' to a maize HPS70 intron sequence and upstream of a nopaline 
synthase 3' transcription termination and polyadenylation sequence. Convenient restriction sites 
are positioned between the intron sequence and the 3' termination sequence for insertion of 
genetic elements. pMON19643 is identical to pMON19338 except that a GOX enzyme encoding 
sequence is used in place of the Agrobacterium EPSPS enzyme encoding sequence. 
pMON19574 is identical to pMON19338 but additionally contains a chloroplast targeted 
glyphosate-oxidoreductase expression cassette identical to that in pMON19643 downstream of 
and immediately adjacent to the EPSPS expression cassette. pMON32570 is similar to 
pMON19574 in that expression cassettes encoding a chloroplast targeted EPSPS and chloroplast 
targeted GOX are present, however, an expression cassette encoding a chloroplast targeted 
AMPA acyltransferase enzyme is also present between the EPSPS and GOX expression 
cassettes. Other elements which are present in pMON19574 and not in the other plasmids are 
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also worthy of mention. For example, a wheat major chlorophyll a/b binding protein gene 5' 
untranslated leader is present between the enhanced 35S promoter and intron in both the EPSPS 
and AMPA acyltransferase expression cassettes (McElroy et al., Plant Cell 2:163-171, 1990). 
Also, a wheat hspl7 gene 3' transcription termination and polyadenylation sequence is present in 
place of the nopaline synthase 3' sequence for both EPSPS and AMPA acyltransferase 
expression cassettes. All plasmids produced recombinant glyphosate tolerant wheat plants using 
the ballistic transformation method described above. However, plasmids which were capable of 
expressing GOX only or GOX along with an AMPA acyltransferase either did not produce 
recombinant glyphosate tolerant wheat plants or produced plants which experienced problems 
with stunted growth, aberrant segregation of phenotypes, and infertility and were not analyzed 
further. The data obtained after biolistic transformation using the described plasmids is shown in 
Table 18. 

Table 18. Wheat Biolistic Transformation Data 



Glyphosate Tolerance 
Gene(s) 



# Explants 



# Transgenic 
Events 



Transformation 
Efficiency 1 



GOX 



120 



0 



GOX + PhnO 



434 



EPSPS 



120 



_6_ 
6 



1.4 



5.0 



EPSPS + GOX 



120 



0.8 



EPSPS + PhnO + GOX 



10,068 



314 



3.1 



1 - transformation efficiency based on percentage of transgenic events identified from a 
total population of explants arising from a combination of experiments in which a 
particular vector construct has been bombarded into immature embryo's. 

Transformed glyphosate tolerant plants arising out of these transformations were self 
crossed and allowed to produce Rl seed, which were used to generate Rl plants. Glyphosate 
tolerance generally segregated in the expected ratio of 3:1 in Rl plants as judged by Rl plant 
sensitivities after spraying with glyphosate at the three leaf stage. Glyphosate tolerant Rl plants 
were self crossed and allowed to produce R2 seed. R2 seed was germinated from a number of 
different glyphosate tolerant lines to produce R2 glyphosate tolerant plants to which [ l4 C]- 
glyphosate was applied as described above. Plant leaf and stem tissues were harvested at 48 
hours after glyphosate application, and water soluble compounds were extracted as described 
above and analyzed by HPLC as in example 2 for the presence of [ 14 C]-glyphosate metabolites. 
The total area under the [ 14 C] isotope labeled peaks eluting from the column was summed to 
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provide a baseline of 100% [ l4 C]-compound identification for each sample analyzed. The results 
are shown in Table 1 9. 



Table 19, 



Glyphosate Metabolism In Wheat Plant Extracts 1 



Sample & Glyphosate 
Tolerance Gene(s) 


' Plant 

i lain . 

Line No 


r l4 ci- 

Glyphosate 


[ l4 C]- 
AMPA 


Acetyl- 
[ 14 C]-AMPA 


["*C]-Other J 


Standard 2 


lid 


! 30 


26 


31 


13 


lia 


29 


24 


29 


18 


llo 


35 


29 


36 


o 


Growth Medium 3 


WtX 




12 


0.2 


g 


no 
lid 


48 


25 


2 


25 


na 
lie* 


87 


7 


0 


6 


EPSPS 


24756 


43 


25 


1 


31 


24756 


53 


46 


0 


1 


25397 


61 


38 


0 


1 


25397 


37 


19 


1.2 


43 


25397 


64 


20 


0 


16 


EPSPS + PhnO + GOX 


27249 


6 


7 


85 


2 


27249 


14 


12 


61 


13 


27249 


5 


24 


33 


38 


25462 


48 


21 


0 


31 


25462 


44 


5 


0 


51 


25462 


54 


35 


0 


11 


26281 


48 


. 14 


17 


21 


26281 


64 


11 


13 


12 


26281 


38 


7 


7 


48 


28598 


20 


7 


5 


68 


28598 


25 


7 


5 


63 


Bobwhite 


na 


74 


26 


0 


0 


na 


17 


15 


0 


32 


na 


34 


24 


0 


42 



1 - [J mill uwut tAiiatia rrwv ouaijrtvu w; * l y rl 1 

1 , and the area under the plots for each peak were summed to provide a base of 1 00% [ C]- 
compound identification for each sample. 

2 - standard solution containing approximately equal [ 14 C] molar ratios of each known glyphosate 
metabolism related compound. 

3 - growth medium including [ u C]-glyphosate; glyphosate has previously been shown to be 
degraded by a photolytic process to AMPA, which can be autoacylated in the presence of certain 
acyl compounds (MSL-0598). 

4 - uncharacterized [ ,4 C]-labeled compounds which are resolved using the disclosed 
chromatographic method. Retention time of glyphosate is about 9.6 minutes, AMPA is about 5.4 
minutes, N-acetyl-AMPA is about 12.5 minutes, and the major [ 14 C]-labeled impurity in the 
[ ,4 C]-glyprnsate sample is about *.7 minutes. 
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The standard solution contains approximately equal molar ratios of each of the 
compounds glyphosate, AMPA and N-acetyl-AMPA, as well as a number of impurities which 
are present as a result of the chemical synthesis of these isotope labeled compounds. Growth 
medium to which [ i4 C]-glyphosate was added was treated to the same conditions as wheat plants, 
ie, the medium was exposed to incident light intensities which plants received. As expected, 
. photodegradation of glyphosate to AMPA was observed, and a small percentage of AMPA 
appeared to be converted to acetyl-AMPA, probably as a result of exposure in the growth 
medium to other acylated compounds. Photodegradation of glyphosate by visible light exposure 
to AMPA as the major degradation product has been observed previously (Lund-Hoeie et al., 
Photodegradation of the herbicide glyphosate in water. Bull. Environ. Contam. Toxicol. 
36:723-729, 1986). Recombinant wheat plants transformed with an EPSPS-only plasmid did not 
produce [ 14 C]-AMPA or acetyl- [ ,4 C]- AMPA from [ ,4 C]-glyphosate. [ ,4 C]-AMPA and trace 
amounts of acetyl-[ ,4 C]-AMPA which were observed were within the limits observed as a result 
of photodegradation in the growth medium control. Non-recombinant Bobwhite control plants 
treated with [ l4 C]-glyphosate also did not produce AMPA or acetyl-AMPA. Plants transformed 
with the triple gene construct plasmid containing genes capable of expressing EPSPS, PhnO and 
GOX produced variable results. About one third of these plants appeared to efficiently convert 
glyphosate to acetyl-AMPA, indicating that the GOX. and PhnO enzymes were present and 
functional. Southern blot analyses demonstrated that the transgenes were integrated into the 
wheat genomes and transmitted to the following generations. Western blot analysis using anti- 
EPSPS, anti-GOX, or anti-PhnO antiserum to detect these proteins in the triple gene transformed 
plant extracts provided further insight into the basis for the variable [ ,4 C]-glyphosate metabolism 
observation. Western blot analysis indicated that all of the lines were producing EPSPS, 
however only line 27249 was producing GOX and PhnO protein. This result is consistent with 
the data in Table 19, which shows that line 27249 efficiently metabolizes [ 14 C]-glyphosate to 
acetyl-[ 14 C]-AMPA. This plant line also did not demonstrate stunting, partial fertility, or altered 
segregation phenotypes associated with other lines. These results indicate that co-expression of 
GOX and AMPA acyltransferase in wheat plants expressing recombinant EPSPS provides 
improved herbicide tolerance. 



WO 00/29596 



PCI7US99/27152 



-113- 

EX AMPLE 9 

This example illustrates the transformation of tobacco chloroplasts with aphnO gene. 

Recombinant plants can be produced in which only the mitochondrial or chloroplast 
DNA has been altered to incorporate the molecules envisioned in this application. Promoters 
which function in chloroplasts have been known in the art (Hanley-Bowden et al., Trerfds in 
Biochemical Sciences 12:67-70, 1987). Methods and compositions for obtaining cells 
containing chloroplasts into which heterologous DNA has been inserted have been described, for 
example by Daniell et al. (U.S. Pat. No. 5,693,507; 1997) and Maliga et al. (U.S. Pat. No. 
5,451,513; 1995). A vector can be constructed which contains an expression cassette from 
which an acyltransferase protein could be produced. A cassette could contain a chloroplast 
operable promoter sequence driving expression of, for example, a phnO gene, constructed in 
much the same manner as other polynucleotides herein, using PCR methodologies, restriction 
endonuclease digestion, and ligation etc. A chloroplast expressible gene would provide a 
promoter and a 5' untranslated region from a heterologous gene or chloroplast gene such as 
psbA, which would provide for transcription and translation of a DNA sequence encoding an 
acyltransferase protein in the chloroplast; a DNA sequence encoding an acyltransferase protein; 
and a transcriptional and translational termination region such as a 3' inverted repeat region of a 
chloroplast gene that could stabilize an expressed mRNA coding for an acyltransferase protein. 
Expression from within the chloroplast would enhance gene product accumulation. A host cell 
containing chloroplasts or plastids can be transformed with the expression cassette and then the 
resulting cell containing the transformed chloroplasts can be grown to express the acyltransferase 
protein. A cassette may also include an antibiotic, herbicide tolerance, or other selectable marker 
gene in addition to the acyltransferase gene. The expression cassette may be flanked by DNA 
sequences obtained from a chloroplast DNA which would facilitate stable integration of the 
expression cassette into the chloroplast genome, particularly by homologous recombination. 
Alternatively, the expression cassette may not integrate, but by including an origin of replication 
obtained from a chloroplast DNA, would be capable of providing for replication of, for example, 
a heterologous phnO or other acyltransferase gene within the chloroplast. 

Plants can be generated from cells containing transformed chloroplasts and can then be 
grown to produce seeds, from which additional plants can be generated. Such transformation 
methods are advantageous over nuclear genome transformation, in particular where chloroplast 
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transformation is effected by integration into the chloroplast genome, because chloroplast genes 
in general are. maternally inherited. This provides environmentally "safer" transgenic plants, 
virtually eliminating the possibility of escapes into the environment. Furthermore, cnloroplasts 
can be transformed multiple times to produce functional chloroplast genomes wh :<i express 
multiple desired recombinant proteins, whereas nuclear genomic transformation-has been shown 
to be rather limited when multiple genes are desired. Segregational events are thus avoided 
using chloroplast or plastid transformation. Unlike plant nuclear genome expression, expression 
in chloroplasts or plastids can be initiated from only one promoter and continue through a 
polycistronic region to produce multiple peptides from a single mRNA. 

The expression cassette would be produced in much the same way that other plant 
transformation vectors are constructed. Plant chloroplast operable DNA sequences can be 
inserted into a bacterial plasmid and linked to DNA sequences expressing desired gene products, 
such as PhnO proteins or other similar acyltransferases, so that the acyltransferase protein is 
produced within the chloroplast, obviating the requirement for nuclear gene regulation, capping, 
splicing, or polyadenylation of nuclear regulated genes, or chloroplast or plastid targeting 
sequences. An expression cassette comprising uphnO or similar acyltransferase gene, which is 
either synthetically constructed or a native gene derived directly from an E. coli genome, would 
be inserted into a restriction site in a vector constructed for the purpose of chloroplast or plastid 
transformation. The cassette would be flanked upstream by a chloroplast or plastid functional 
promoter and downstream by a chloroplast or plastid functional transcription and translation 
termination sequence. The resulting cassette could be incorporated into the chloroplast or 
plastid genome using well known homologous recombination methods. 

Alternatively, chloroplast or plastid transformation could be obtained by using an 
autonomously replicating plasmid or other vector capable of propagation within the chloroplast 
or plastid. One means of effectuating this method would be to utilize a portion of the chloroplast 
or plastid genome required for chloroplast or plastid replication initiation as a means for 
maintaining the plasmid or vector in the transformed chloroplast or plastid. A sequence enabling 
stable replication of a chloroplast or plastid epigenetic element could easily be identified from 
random cloning of a chloroplast or plastid genome into a standard bacterial vector which also 
contains a chloroplast or plastid selectable marker gene, followed by transformation of 
chloroplasts or plastids and selection for transformed cells on an appropriate selection medium. 
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Introduction of an expression cassette as described herein into a chloroplast or plastid replicable 
epigenetic element would provide an effective means for localizing an acyltransferase gene and 
protein to the chloroplast or plastid. 

In view of the above, it will be seen that the several, advantages of the invention are 
achieved and other advantageous results attained. As various changes could be made fa the 
above methods and compositions without departing from the spirit and scope of the invention, it 
is intended that all matter contained in the above description, and shown in the accompanying 
drawings and sequences, shall be interpreted as illustrative and not in a limiting sense. 
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Claims: 

1 . A recombinant plant transformed with a polynucleotide sequence comprising: 

a) a plant functional promoter sequence operably linked to; 

b) a structural DNA sequence which encodes an acyltransferase enzyme, operably 
linked to; * 

c) a 3' sequence which functions in plants to cause transcription termination; 
wherein the promoter sequence causes sufficient expression of said enzyme in plant 

tissue to enhance the phosphonate herbicide tolerance of a plant transformed with 
said polynucleotide sequence, and wherein said enzyme transfers an acyl group 
from an acylated donor compound to the terminal amine of a phosphonate 
herbicide. 

2. The plant of claim 1 in which said acyltransferase enzyme is localized to plastids in said 
plant. 

3. The plant of claim 2 wherein said plastids comprise chloroplasts. 

4. The plant of claim 3 in which said structural DNA sequence comprises a 5' sequence 
encoding an amino-terminal chloroplast transit peptide operably linked 5' to said 
structural DNA sequence, wherein said enzyme is localized to the plant chloroplasts or 
plastids. 

5. The plant of claim 2, wherein said acylated donor is an acyl coenzyme A. 

6. The plant of claim 5 wherein said acyl coenzyme A is selected from the group consisting 
of acetyl coenzyme A, propionyl coenzyme A, malonyl coenzyme A, succinyl coenzyme 
A, and methyl-malonyl coenzyme A. 

7. The plant of claim 6 wherein said acyl coenzyme A is acetyl coenzyme A. 

8. The plant of claim 6 selected from the group consisting of corn, wheat, cotton, rice, 
soybean, sugarbeet, canola, flax, barley, oilseed rape, sunflower, potato, tobacco, tomato, 
alfalfa, lettuce, apple, poplar, pine, eucalyptus, acacia, poplar, sweetgum, radiata pine, 
loblolly pine, spruce, teak, alfalfa, clovers and other forage crops, turf grasses, oilpalm, 
sugarcane, banana, coffee, tea, cacao, apples, walnuts, almonds, grapes, peanuts, pulses, 
petunia, marigolds, vinca, begonias, geraniums, pansy, impatiens, oats, sorghum, and 
millet. 
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9. The plant of ciaim 2 in which the promoter sequence is derived from a plant DNA virus 
promoter sequence. 

10. The plant of claim 9 in which said promoter sequence is selected from the group 
consisting of CaMV35S, FMV35S, enhanced CaMVJSS, enhanced FMV35S, comalina 
yellow mottle virus promoter, and sugar cane bacilliform DNA virus promoter. , 

11. The plant of claim 10 in which the structural DNA sequence is or is complementary to an 
E. coliphnO gene sequence as set forth in SEQ ID NO:3, 

12. The plant of claim 1 1 wherein said structural DNA sequence encodes a peptide selected 
from the group consisting of SEQ ID NO:4 and SEQ ID NO:8. 

13. The plant of claim 12 in which the structural DNA sequence is derived from a microbe, 
wherein said sequence is or is complementary to a polynucleotide sequence capable of 
hybridizing to a sequence selected from the group consisting of SEQ ID NO:3, SEQ ID 
NO:7, SEQ ID NO:ll s SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, and SEQ ID 
NO: 19. 

14. The plant of claim 1 3 in which the acyltransferase enzyme is substantially similar to an E. 
coli PhnO peptide which functions in plants to transfer an acyl group from an acylated 
donor compound to the terminal amine of a phosphonate herbicide. 

15. The plant of claim 14 wherein said phosphonate, herbicide is selected from the group 
consisting of glyphosate and AMPA. 

16. The plant of claim 4 in which the 5' sequence encoding an amino-terminal chloroplast 
transit peptide is selected from the group consisting of SEQ ID NO:9, SEQ ID NO:ll, 
SEQ ID NO: 13, and SEQ ID NO: 14. 

17. The plant according to claim 3 which exhibits enhanced tolerance to one or more 
phosphonate herbicides, the herbicide being selected from the group consisting of 
glyphosate and AMPA. 

18. A seed produced from the plant of claim 17, wherein said seed comprises said 
polynucleotide sequence. 

1 9. A plant grown from the seed of claim 1 8. 

20. A stably transformed phosphonate herbicide tolerant recombinant plant which contains a 
polynucleotide sequence comprising: 

a) a plant functional promoter sequence operably linked to; 
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b) a structural DNA sequence which encodes an acyltransferase enzyme, operably 
linked to; 

c) a 3' sequence which functions in plants to cause transcription termination; 
wherein the promoter sequence is heterologous with respect to the structural DNA 

sequence and causes sufficient expression of said enzyme in plant tiss&e to 
enhance the phosphonate herbicide tolerance of a plant transformed with said 
polynucleotide sequence, wherein said enzyme transfers an acyl group from an 
acylated donor to the terminal amine of a phosphonate herbicide, and wherein 
said plant expresses a GOX gene which encodes a plant functional glyphosate 
oxidoreductase enzyme. 

21. A method for selectively enhancing phosphonate herbicide tolerance in a recombinant 
plant comprising the steps of: 

a) transforming said plant with a polynucleotide sequence comprising 

i) a promoter sequence which functions in plants to cause the production of 
an RNA sequence, operably linked to; 

ii) a structural DNA sequence capable of producing an RNA sequence which 
encodes an acyltransferase enzyme, operably linked to; 

iii) a 3' non-translated sequence which functions in plants to cause the 
addition of a polyadenylated nucleotide sequence to the 3 ' end of said 
RNA sequence; 

wherein the promoter sequence is heterologous with respect to the structural DNA 
sequence and causes sufficient expression of said enzyme in plant tissue to 
enhance the phosphonate herbicide tolerance of a plant transformed with 
said polynucleotide sequence, wherein said enzyme transfers an acyl 
group from an acylated donor substrate to the terminal amine of a 
phosphonate herbicide substrate, and wherein said plant expresses a GOX 
gene which encodes a plant functional glyphosate oxidoreductase enzyme; 
and 

b) expressing a herbicide tolerant effective amount of said acyltransferase enzyme in 
said plant. 
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22. The method according to claim 21 , wherein said acyitransferase is expressed from a DNA 
sequence selected from the group consisting of SEQ ID NO:3, SEQ ID NO:7, SEQ ID 
NO:li,andSEQ ID NO:19. 

23. A method for producing a genetically transformed phosphonate herbicide tolerant plant 
comprising the steps of: ' 

a) inserting into the genome of a plant cell a polynucleotide sequence comprising; 

i) a promoter sequence which functions in plant cells to cause the production 
of an RNA sequence, operably linked to; 

ii) a structural DNA sequence capable of producing an RNA sequence which 
encodes an acyl transferase enzyme which transfers an acyl group from an 
acylated donor to the terminal amine of a phosphonate herbicide substrate, 
operably linked to; 

iii) a 3' non-translated sequence which functions in plant cells to cause the 
addition of a polyadenylated nucleotide sequence to the 3' end of said 
RNA sequence; 
wherein the promoter sequence is heterologous with respect to the 

' structural DNA sequence and causes sufficient expression of said enzyme 
in a plant cell to enhance the phosphonate herbicide tolerance of a plant 
cell transformed with said polynucleotide sequence; 

b) selecting a transformed plant cell; and 

c) regenerating from the transformed plant cell a genetically transformed plant 
which exhibits improved phosphonate herbicide tolerance. 

24. A method for producing a genetically transformed phosphonate herbicide tolerant plant 
comprising the steps of: 

a) inserting into the genome of a plant cell a polynucleotide sequence comprising: 

i) a promoter sequence which functions in plant cells to cause the production 
of an RNA sequence, operably linked to; 

ii) a structural DNA sequence capable of producing an RNA sequence which 
encodes an acyltransferase enzyme which transfers an acyl group from an 
acylated donor to the terminal amine of a phosphorate herbicide substrate, 
operably linked to; 
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iii) a 3' non-translated sequence which functions in plant cells to cause the 
addition of a polyadenylated nucleotide sequence to the 3' end of said 
RNA sequence; 
wherein the promoter sequence is heterologous with respect to the 
structural DNA sequence and causes sufficient expression of said eiCzyme 
in a plant cell to enhance the phosphonate herbicide tolerance of a plant 
cell transformed with said polynucleotide sequence, and wherein said 
plant cell expresses a GOX gene which encodes a glyphosate 
oxidoreductase enzyme; 

b) selecting a transformed plant cell; and 

c) regenerating from the transformed plant cell a genetically transformed plant., 
which exhibits improved phosphonate herbicide tolerance. 

25. The method according to claim 24, wherein a transformed plant cell is selected by an 
ability to grow in the presence of a selective agent, wherein said agent is selected from 
the group consisting of glyphosate and AMPA. 

26. A peptide comprising an acyltransferase enzyme that catalyzes the transfer of an acyl 
group from an acylated donor to the terminal amine of a phosphonate herbicide. 

27. The peptide according to claim 26 wherein the acyltransferase enzyme is comprised of a 
sequence selected from the group consisting of SEQ ID NO:4, SEQ ID NO:8, SEQ ID 
NO:12,andSEQIDNO:20. 

28. The peptide according to claim 26 wherein said phosphonate herbicide is selected from 
the group consisting of glyphosate and AMPA. 

29. The peptide according to claim 26 wherein the acyltransferase enzyme is expressed in a 
cell from a DNA sequence which is or hybridizes to a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:3, SEQ ID NO:7, SEQ ID NO:l 1, and SEQ ID 
NO:19. 

30. The peptide according to claim 27 wherein the DNA sequence is derived from a microbe, 
said microbe being a member of the genus selected from the group consisting of 
Enterobacteriaceae, Streptomyces, Bacillus, Actinobacillus, Ascomycota, and 
Basidiomycota. 
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31. A method for selecting one or more cells transformed with a vector containing an 
acyltransferase gene encoding an enzyme which functions to N-acetylate a phosphonate 
herbicide compound comprising the steps of: 

a) transforming a population of cells with said vector; 

b) incubating said transformed cells in the presence of an inhibitory amoufct of 
phosphonate herbicide compound; 

c) identifying one or more cells that grow in the presence of said inhibitory amount 
of said compound; and 

d) isolating and purifying said one or more cells that grow in the presence of said 
inhibitory amount of said compound. 

32. A vector according to claim 31 comprising an acyltransferase gene encoding an enzyme 
which functions in said one or more cells to transfer an acyl group from an acylated 
donor substrate to the terminal amine of a phosphonate herbicide compound. 

33. The vector according to claim 3 1 wherein the gene is expressed in a host cell. 

34. The vector according to claim 33 wherein the host cell is not inhibited by the presence of 
a phosphonate herbicide in an amount which is inhibitory to a host cell lacking a 
functional acyltransferase gene encoding said enzyme. 

35. The host cell according to claim 34 selected from the group consisting of bacterial cells, 
fungal cells, animal cells, and plant cells. 

36. The host cell according to claim 35 wherein the bacterial cell is selected from the 
bacterial species consisting of Enterobacteriaceae, Mycobacteriaceae, Agrobacteriaceae, 
Actinobacteriaceae, Streptomyces, and Bacillus. 

37. The host cell according to claim 35 wherein the fungal cell is selected from the fungal 
species consisting of Ascomycota, Basidiomycota, and Deuteromycota. 

38. The host cell according to claim 35 wherein the plant cell is selected from the plant 
species consisting of Glycine max, Zea mays, Nicotania tabacum, Gossypium gossypia, 
Triticum aestivum, and Brassica napus. 

39. An antibody which binds to an acyltransferase protein sequence, wherein said 
acyltransferase is selected from the group consisting of SEQ ID NO:4, SEQ ID NO: 8, 
SEQ ID NO: 12, and SEQ ID NO:20. 

40. A method for identifying a recombinant acyltransferase gene in a sample comprising 
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a) pro v iding one or more distinct polynucleotide sequences capable of hybridizing to 
said gene; 

b) providing a reference sample comprising one or more polynucleotide sequences 
complementary to said distinct polynucleotide sequences; and 

c) providing instructions for combining said distinct sequences, said reference 
sample, and said recombinant acyltransferase gene in a sample, and 

d) detecting the recombinant gene in said sample. 

41. A kit for detecting the presence of a recombinant acyltransferase gene in a sample 
comprising 

a) providing one or more distinct polynucleotide sequences capable of hybridizing to 
said gene; 

b) providing a reference sample comprising one or more polynucleotide sequences 
complementary to said distinct polynucleotide sequences; and 

c) providing instructions for combining said distinct sequences, said reference 
sample, and said recombinant acyltransferase gene in a sample 

packaged together in a kit. 

42. A plant comprising a polynucleotide sequence containing a gene which encodes an 
acetyltransferase protein, wherein expression of said gene in said plant stimulates said 
plant's growth. 
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SEQUENCE LISTING 

<110> Barry, Gerard F. 

5 <120> Phosphonate Metabolizing Plants 

<130> 38-21(15303) PhnO 

<140> unknown 
10 <141> 1999-11-16 

<150> 60/108,763 
<151> 1998-11-17 

15 <160> 32 

<170> Patent In Ver. 2.0 

<210> 1 
20 <211> 15611 
<212> DNA 

<213> Escherichia coli 
<400> 1 

25 ggatccagca tcgacgccag tttttccacc attgtcagtc gcaggctaag cggcgcattt 60 
aacatgccgc cgttcgtcca tgtctgaagc tgcacacgcg aaagaagttc ctgcatcagt 12 0 
cgttcacgaa actgctgctg atgggcttgt ggaaggcggg catcatcgcc ctgcgccaga 180 

30 

tccactaaaa agcggggata aaccgactcc agcacgcgac cggggccgtc cagtaacgtc 24 0 
ttggtcaata tcgttctgcc gtgaaaagtg tttgaatatc atcgcgtaac agctgggcgt 30 0 
35 cggtgtaaat ccagccgtga gtcatcacag tctgctgcaa ttgctgctgc atcagcctga 360 
ccaccgattc attttgttga cgcagagcca ggctttcgcg taaacgcgtc tgtaattccg 420 
tcaaacatga agcgaactca gcgaaaaaag tattcatgcc tgccgtaaca gattcatcga 48 0 

40 

cctgctctgc cagaacttta gccatttgtt ggcaataaag atcgacttct gcgcttaatg 540 
ctcgttgcaa cacactgtaa tcaaccgttt ctgtcgggga tttctcattt ccccgtcccc 600 
45 agtcgggctg attcaaccag cgcgaaaaag tctcacgcac aacgcctaaa cgcgtgctct 660 
gctcgtccgt tgcatcctgg cgcgaaatga ctgcactgaa cagctggcga gtgttgaagt 72 0 
ggggaactac gccgtgaaaa acaggaaaat gaaacccagg acgaaaccct gactcgctca 780 

50 

attccatttt gacttgttgc tcaatggggc gaataacatc ggttaacact cggcaaaggg 84 0 
tggattccag ctcggcaaaa cgcagcgtaa agtcgcgact gatggtgttc tgcgccgtct 900 
55 gtaacagtgt ctcacagcgg gtacgcatct cgcttaacgg ctccgaatca tcctgaaaca 960 
aggcggctaa ctgcgcattc agcgcatctt gtt-ttgacg cagaaagtgg ttggrqgagg 1020 
tcagggccag ctcgatttca tgtttaatct cgccgctcac ctgcgcctga ttgagttgca 1080 
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atagctgcaa 


actttcttcg 


acctgatgga 


5 


ttaacccgtg 


cgcacgaaaa 


tccaggtatt 




tatgcgcagc 


agatcgcaaa 


gcatacaacg 




cctgaattgg 


ctgggcgaac 


agcgaatctt 


10 


at tccra crcrtc 


cgccagatcg 


gcatgacgcc 




cttccaccca 


qcqttqttgc 


tctggcgctg 


15 


acacccaatt 


cqccaggtag 


ccccacatcg 


tacagccttt 


catcagcgtc 


ccggaaatca 




tacgatcctg 


ttgatcgaac 


ttattgacca 


20 


ccaaaatcgc 


ctcacggacc 


tcttcatcgg 




ccgccagtac 


cqccqaqgcg 


cgtgccagct 


25 


gcccggcttc 


atttggcccg 


ggggtatcca 


gccccgccag 


atggacaaac 


tccacttcaa 




cagaaaaagg 


aaaatcgacg 


tccagcgcct 


30 


tcagacaatg 


aaaaataggc 


tgggcaccca 




cgatccgctg 


cataagcgca 


cgcatatctt 


35 


taatatcgca 


atcacgcagg 


cgctgttgta 


catgtgaaaa 


atgcagtacc 


ggttcctttt 




gcgcagtcat 


tgggcgatta 


cgattaggca 


40 


tggttgattt 


ccctgctttc 


atggtaccga 




ttttacgcaa 


ctcattattc 


agcatcgcgt 


45 


gtaaagccag 


ttgtggcgcg 


tcgtctccgg 


gcaactgttg 


gcgagaaagc 


tgtaacaggc 




tctgtgtgta 


catagaaaat 


tcttccttaa 


50 


tgtttttgag 


ttctgttttc 


ggcttttata 




gtgttgtttt 


ttatcagaat 


aaataacgtc 


55 


aaaat -itcat 


gtgaattaaa 


aaaagaacaa 


atagaggcaa 


aaagattatt 


ttctttttgc 



- 2 - 



tattttgccg 


caattgttca 


caagcgacgt 


114u 


cccgcgcctg 


ctgcgcgtaa 


ttcaacagtt 


1200 


aggcgttagc 


gtaagcggca 


tgaagcaacg 


1260 


cccacaactg 


atcggcagca 


tgacgaatat 


1320 


agcgcctgcc 


gagcgcggca 


tgggcaaaat 


1380 


gtaacttacc 


gttgttggct 


aactcatggc 


1440 


acgacaccgg 


aaatatctgc 


tgtggcgtaa 


1500 


gtgcccgcac 


ctggtcggcg 


tcgtcactgt 


1560 


gcacatacag 


cggcaccgat 


tgccccaccg 


1620 


agatcgattt 


cagttgcgta 


taatccagca 


1680 


gctggttaag 


cattttttgc 


agatgcggtt 


1740 


gtaacgtcaa 


ctgaccggga 


taactctcca 


1800 


tcacgggaat 


atgctcaatg gcggcgtaag 


1860 


tcgccagtcg 


cactaaatca 


ttcaaacttt 


1920 


gataatattt 


ttcgaaagcg acgccatttt 


1980 


tatctatttc 


cagcacatcg gtcagatgct 


2040 


attgttgaat 


taaafcaatcg 


attggcgcga 


2100 


gcccgggcgt 


atggcgaata 


agcgtcggca 


2160 


gaacctccgt 


accaacaatg gcattaatgg 


2220 


caattgcaag 


caccatttcc 


agtcgggaaa 


2280 


gacy tucgyt. 


gatattaggc 


tgactccagg 


2340 


gtacagagag 


aggcattttt 


tccagtaact 


2400 


gttcagcctc 


ctgacttaac 


tcatacaggg 


2460 


agcaaatttt 


gttattttac 


ttagccagat 


2520 


attactgcaa 


gaaataattt 


tatatttagt 


2580 


ttctgatacg 


tttaaaacgt 


cagaaagata 


2640 


gtagagcatt 


aacattatct 


taaataataa 


2700 


gtttcctttc 


aaatgaaaac gatcgtcgtc 


2760 
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taaaatcagc agtacccccg acaaactcag 
gcaatctgcc gttttttggc gcttttgccc 
taccacactg cccaaaatgc aactccgaat 
gcccggaatg tgcctacgaa tggaacgacg 
tcgttaaaga tgctaacggc aatctgctgg 
atctgaaggt gaaaggtagc tcttcgatgc 
gcctggttga aggcgaccat aacatcgatt 
tgaaatctga gtttgtgaaa aagaactgat 
ctcgcgccgc atccggcaac ggtgccagat 
gcctacaaat tcccgcaccc tccgtaggcc 
ggtgccgact gcctgatgcg acgcttgcgc 
tccgtaggcc ggataaggcg tttacgccgc 
acgcttgcgc gtcttatcag gcctacaaat 
tttacgccgc atccggcaat ggtgccgact 
gcctacaaat tcccgcaccc tccgtaggcc 
agtgccgact gcctgatgcg acgctcgcgc 
atttcgaact acacttaact ggcttctctt 
ctacctctct tttgccggta actgttccga 
cgcggaactg ctctataaaa tcagcttcgg 
cgagaactgc ccttccggaa tgcaatttcc 
cattgccgga agcgacatca tgatgagcga 
cggctttacg ctggtgctcg attcgcaaca 
tcttgccgct aacggaaaaa tcgaaatggc 
tggcaaagtc accgataaat ttggcgtacc 
aacgcaataa cccgccggga ggcccgccct 
ttcatctgtt agtcactttt aattaaccaa 
acttttttag ggaggctgca tcatgcaaac 
cttcaatcag catcaggcgc tgcatgcggt 
ggctctgctt gggccgtcgg gttccggcaa 



-3- 

ggattttgtg tataattgcg gccttcttcg 2820 
tgctgacttt tgaggaaatc cacatgtcat 2880 
acacttacga agataacggc atgtacatct 2940 
cagaacctgc acaggaaagc gac<j°?ctga 3000 
ctgacggcga cagcgttacc atcattaaag 3060 
tgaaaattgg caccaaagtg aaaaacatcc 3120 
gcaaaatcga cggttttggt ccgatgaaac 3180 
tgtattgtga tcggtaagcc ggataaggcg 3240 
gcctgatgcg acgcttgcgc gtcttatcag 3300 
ggataaggcg tttacgccgc atccggcaac 3360 
gtcttatcag gcctacaaat tcccgcaccc 3420 
atccggcaac agtgccaact gcctgatgcg 3480 
tcccgcaccc tccgtaggcc ggataaggcg 3540 
gcctgatgcg acgcttgcgc gtcttatcag 3600 
ggataaggcg tttacgccgc atccggcaac 3660 
gtcttatcag gccgcctctc atctgtataa 3720 
aactgaggtc accatcatgc cgttaagtcc 3780 
cgcgattgcc tattatcaac gtacgttggg 3840 
cgaaatgcca aaatcagcgc aggacagcgc 3900 
cgataccgcc atcgctcatg ccaacgtgcg 3960 
tgccatgccg tcaggaaaag ccagctactc 4020 
ggtcgaagaa ggaaaacgct ggtttgacaa 4080 
ctggcaggaa actttctggg cgcatggctt 4140 
gtggatgatt aatgtcgtca aacaacaacc 4200 
cccgcactgt catcgaattc ccgttaactc 4260 
atcgtcacaa taatccgcca cgatggagcc 4320 
gattatccgt gtcgagaagc tcgccaaaac 4380 
tgatctgaac attcatcacg gtgaaatggt 4440 
atccaccctt ttacgtcact taagcggttt 4500 
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gattaccggc 


gataaatccg 


ccggcagcca 


5 


cqaaqqccqt 


ctggcgcgcg 


atatcegcaa 


acaattcaac 


ctggtgaacc 


gectgagegt 




cacrcacqccQ 


ttctggcgca 


cctgttttag 


10 


Qttacaqqcq 


ctgacccgcg 


ttggcatrggt 




Ctccqqcqqa 


cagcagcagc 


gtgtggcgat 


15 


gattctggcc 


gatgaaccca 


tcgcctcgct 


caccctgcgc 


gacatcaatc 


agaacgaegg 




craattaccrcc 


ctgcgctact 


gegaaegcat 


20 




agecaacagt 


ttgataacga 






gagaacgega 


aagctgcctg 


25 


cgctaagata 


attgcctcgc 


tggccttcac 


cccggcacac 


accqaaqaqc 


aggaaaaggc 




acagcaaaac 


ctgaaaccgc 


aatggacgcc 


30 


co t era aaa t ci 


aacgccttct 


ttgccccaga 




caataaaata 


gatatcgect 


ggtacggcaa 


35 


caacggccag 


gtcttcgccc 


agaeggtege 


qttqatcqtc 

3 %3 Z7 


aacaaagata 


gtccgatcaa 




agatctcacc 


ttcggcaatg 


gcgatcctaa 


40 


ctacgtcttc 


gecaaaaaca 


atatctccgc 




gcatgaaacc 


aacgegctgg 


ccgtcgccaa 


45 


cgaaaacctc 


gacaagctga 


aaacctccgc 


ctggaaatcg 


ccgctgatcc 


caggcgatcc 




caccaaagac 


aagatctacg 


acttctttat 


50 


ggtgctggaa 


cgcctgggct 


gggcgccgtt 




gattcgecag 


ctcgcactgt 


ttaaagagat 


55 


tgagcaggac 


'agctggcaa 


aaac cege 


cctgaacaac 


gcgctaagcg 


cgatgagttc 



-4 - 



tatcgagctg 


ctgggccgca 


cagtccagcg 


4560 


aagccgcgcc 


aacaccggct 


acatcttcca 


4620 


actggagaac 


gtgctgattg gcgcgctcgg 


4680 


ctggtttacc 


cgegagcaga 


aacaacgcgc 


4740 


geattttgee 


catcaacgcg 


tttccaccct 


4800 


tgcccgcgcg 


ctgatgeage 


aggegaaggt 


4860 


ggacccggaa 


tccgcccgca 


tcgtgatgga 


4920 


catcaccgtg 


gtcgtcacgc 


tgcatcaggt 


4980 


cgtcgccctg 


cgccaggggc 


aegttttcta 


5040 


aegttttgae 


catctctacc 


gcagcattaa 


5100 


acatccccat 


cattgaggaa 


aacgaatgaa 


5160 


cagcatgttc 


agcctcagca 


ccctgttaag 


5220 


gctgaatttc 


ggcattattt 


caaeggaate 


5280 


attcttacag 


gatatggaga 


agaagctggg 


5340 


ctacgcaggc 


attatccagg gaatgegett 


5400 


cctgtcggca 


atggaagcgg 


tggatcgege 


5460 


ggcggatgga 


tegecaggtt 


actggagcgt 


5520 


caacctgaac 


gatctgetgg 


egaageggaa 


5580 


ctccacctct 


ggcttcctcg 


tccccggtta 


5640 


cagcgacttc 


aagcgcaccg 


tcaacgccgg 


5700 




gatgtggcga 


ccaacaacac 


5760 


geeggagaag 


ctgaaagaac 


tgaaagtgat 


5820 


gategtctgg 


egtaaaaate 


tttccgaaac 


5880 


gaattaegge 


aaaaegcegg 


aagagaaagc 


5940 


ccgcgcctcc 


agcgacctgc 


aactggtgcc 


6000 


geagggegtg 


aaaagcaata 


aaggactgaa 


6060 


gattcaggcg 


caactggatg 


acctggaccg 


6120 


ggtgagtaaa 


gcggtgcagt 


aaategtagg 


6180 
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tcggataaga cgccccggcg tcgcatccga 
gcaagcacct tatccggcct acagaccgga 
5 acccaagcgc a^jtggttct cgcttctgag 
ctcgtggcag ggcgcggaaa tggccccgct 
aaccttcgct gccgacttct tcccgcccga 

10 

t aatggccgtc acgctgcaaa tcgccgtctg 
cccctttggc ctgatgagcg ccgaaaacct 
15 ccgcctgatg gacgcctgcc gcgccattaa 
cgccgtcggt ctcggaccgt tcgctggcgt 
gctctccaag ctgctttccg aagcggtaga 

20 

tcgcgccacc ggtgccaaca agctcgaaga 
gccgctgctg atctcctact ccctctatcg 
25 cgtcggcatg gtcggcgcgg gcgggatcgg 
ccagttccaa caaacctgcg ccctgatggt 
tttcctctct caacggttgc gtaagcactt 

30 

cacttgtcta cacatccgac cagctaccca 
gagcaggagc ttcgtcaaca ctaccgctgc 
35 gcagcgcgct ttgaggtgaa tcgccacacc 
aaaggctggg tacagcgccg tcagggcgtc 
tacccgctca acgcccaggc gcgttttagc 

40 

accagcgaaa aactgctttc ggtattgcgc 
gggattaccg agggggagaa cgtcatccac 
45 gcgctctgtt taatcgacca ctacttcgcg 
ttcgacagcg gctcgctgca cgattttctg 
agccagacgc ggatcagcgc ccgccgcgcc 

50 

ccgaatatgt cgccgctgct gtgcgtgcgc 
ccggcggagt actccgtcag cctgacgcgc 
55 tgaatgcacg cagataccgc gacccgccag 
ccggctgaac tggcagcacg cctgaacgcg 



-5- 

caatgtgcag gcgttgacgc cggacgcggt 6240 
gccaaacatg caaaccatca ccatcgcccc 6300 
ctgggccgtt gttctcgccg tgctggtcgt 6360 
cacgctgatt aaagacggcg gcaacatggc 642 0 
tttcagccag cggcaggatt acctcaccga 6480 
gggcaccgcg ctggcggtgg ttctctccat 6540 
ggtgccgtgg tgggtttacc agcccgttcg 6600 
cgaaatggtc ttcgccatgc tgttcgtggt 6660 
gctggcgcta tttatccaca ccaccggcgt 6720 
agcaattgaa cctggcccgg tggaaggcat 6780 
gatcctctac ggcgtgctgc cgcaggtgat 6840 
cttcgaatcc aacgtccgct cggcgaccgt 6 900 
cgtcaccctg tgggaagcga ttcgcggttt 6960 
gcttatcatc gtcacggtca gcctgctgga 7020 
tatctgataa gcgaggcatt gatatctatg 7080 
acacgctatc aagagatagc cgcaaaactt 7140 
ggcgactatc ttcccgccga gcagcaactg 7200 
ctgcgccgcg ccatcgacca actggtggaa 7260 
ggcgtgctgg tgctgatgcg cccgttcgat 7320 
cagaatctgc tggatcaggg cagccatccc 7380 
cccgcgtccg gccacgtcgc tgacgcactg 7440 
ctgcgcaccc tgcgtcgggt caacggcgtc 7500 
gacctcaccc tctggccgac gctgcaacgc 7560 
cgcgagcaaa ccggaattgc gctgcgccgc 7620 
caggccaaag agtgccagcg tcttgaaatc 7680 
acccttaacc accgtgacgg tgaaagcagc 7740 
gccgacatga ttgaattcac tatggagcac 7800 
cactggatgt ccgtgctggc gca^agccaa 7860 
ctaaacatca ccgccgacta tgaggtgatc 7920 
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CQCQCCCTCtCI 

v*ys~ywwyv« w 


aaactggcct 


ggtacagatt 


caggegegga 


tgggeggcac 


eggegaaegt 


7980 


5 


ttttttgccg 


gcgacgccac 


gctgacccgc gccgccgtgc 


gectgactga 


cggcacgctc 


8040 




ooc fc a c aac t 


a acr t a c t ocrcr 
yyy *-y "-yyy 


gcgtgataaa 


cagcacgccg 


aacgctgcgc 


gctgattgac 


8100 




ococtoatcrc 
y ^y *■* ^*y y 


aacaatctccr 


ccactttcaa 


aacttatcag 


aaacccttat 


tgccccgctg 


8160 


10 


era p cr p 1* era r* p 


crt"at*cracaca 
y ^y y ^* a ^~y 


cattgccgca 


cgccaggccg 


aagtgaacgc 


cagccgggtc 


8220 




rrsj ct" t" r«t" t* t" A 
y c&i. llluuuo 


err pi f era 1 t CO 
^•yauyy i»w 


eggagacaac gcatgaccct 


aaaaaccact 


tttatgette 


8280 


15 


p pa fc cr p a aa a 


tocccaacac 


agttttcgtc gcctgttaaa 


aaccatcracfc 


Q a cr c cqq a c a 
y "y wv *yy y ^-z) 


8340 


fc era fc fc cr t pa p 


crtacatcaa 


ctcaaacgcg gctggcaacc 


gctgaatatc 


gccaccacca 


8400 




p'prrt'CJpfcCTPt" 


y cv w y v- m ^ w 


gataacgaca 


cgccggtgtg 


gctttctacc 


ccat taaata 


8460 


20 




a. cv i_ a. y ay u 


ctgcgttttc 


ataccaacgc 


occactcrcrtc 


agccagccgg 


8520 






pfc t* perpera t* a 
u l. uuy ^*yy y 


aeggatgagg 


cgatttccag 


cgaacagctc 


aacgcccttt 


8580 


25 




\-y i*i»y i»y v-y 


ceggaagegg 


gcgcgacgct 


gattttacag 


gtcgccagcc 


8640 


t" cracr r*Cf cr poct 
uyay <_yyt-yy 


acacatatta 




cr rrr rennet t~ a t~ 
y t-y v*y yy u ci u 


tgccgaagaa 


cgaatgatcg 


8700 




ctcccrcacrct 

w ^- w w y Ci. y w w 


crccaaaatiac 


attctgeacg 


aactcaccga 


gcgcccgcac 


ccgttcccgc 


8760 


30 


fccaacafccaa 


cctgatcctg 


acctgcggcg 


aacgcctgct 


ggctattccg 


cgaaccacgc 


8820 




a t cr t era* acrcr t 

*» wy '■"■y y ^y y \- 


gtgctgatgt 


acgttgccgt 


aaaaggggge 


gaaaaggega 


tcgacgccgc 


8880 


35 


ccacgccctg 


caagagagee 


gaegcegggg 


cgataccgat 


< 

ttgectgaac 


tgagegtege 


8940 


ccagat fcgaa 


cagcagct ta 


acctcgcggt 


agatcgegtg 


atgaccgaag 


gcggcattgc 


9000 




caacccrccraa 

w y v — - y ^y *** ^* 


ctcfocoacQC 


tggcgctgaa 


acaggccagc 


ggcgataacg 


ttgaagegat 


9060 


40 


fcfcfcppfcapfca 




gcaccacgtt 


ggcgaagctg 


gcggtaagcg agccgctcga 


9120 




caccaccggg 


atgegtcteg 


aacgeegtat 


ctccgccgtt 


tataaagaca 


ttcccggcgg 


9180 


45 


ccagctgctt 


ggcccaacct 


acgactacac 


ccatcgcctg 


ctcgatttta 


ccctgctggc 


9240 


aaacggcgaa 


gcgccgacgc 


tgaccaccgc 


egacagegaa 


caacagccgt 


cgccgcacgt 


9300 




tttcagcctg 


ctggcgcgtc 


aggggctggc 


gaagtttgaa 


gaggatagcg 


gcgcacagcc 


9360 


50 


ggatgacatc 


acccgcacgc 


cgccggttta 


cccctgctca 


cgttcttccc 


gtttgeagea 


9420 




gttgatgege 


ggcgacgaag 


getatttget 


ggcgctggcc 


tactccaccc 


agcgtggtta 


9480 


55 


eggacjeaat 


cacccgttcg 


egggegagat 


ccgcagtggt 


tacatcgacg 


tgtcgattgt 


9540 


geeggaagag 


ctgggatttg 


eggtaaaegt 


eggegaacta 


ctgatgaccg agtgtgaaat 


9600 



t 
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ggtcaacggt 


tttatcgacc 


cgccggatga 




ggtattcggc 


atgagcgagc 


gcaaagcgat 




ggctccggaa 


tacggcgagc 


acgcgacagg 




acatgccgac 


aacgtcgaag 


ccgcaggctt 


10 


cgatttccag 


gccgaactgg 


agctactcaa 


atctgagcgg 


ctacaacttt 


gcctacctcg 




ccatcttaaa 


agcggtggcg 


atccccggtt 


15 


cgatgccata 


cggctgggga 


accggcggca 




gcgacgtgct 


aaaggtgatt 


gaccagggtg 


20 


gcaacttctt 


taagcgcgtg 


accggggtaa 


ttatccagac 


gcgtcaccgc 


atccccgaaa 




tccaggtgcc 


aatcccggaa 


ccgctgcgct 


25 


ccatgcacgc 


gctggaagag 


tacggcgtga 




gcttcggtca 


tatcgccact 


acctacgcct 


30 


tggacccgtc 


gccgatcccg 


aaattcgata 


aactgttcgg 


cgcggggcgc gagaagcgca 




aaagtctcga 


tttcgacgat 


cacccgttca 


35 


tctgcggatc 


gacccacagc 


tatcttgatg 




tgtttgtctg 


ctccgatacc 


gattattgcc 


40 


tcaaccgtta 


ctttcggtca 


ataacctgac 


cgatgtctct 


tttgatttat 


ggccggggga 




cgggaagacc 


acgctgctga 


agtcgatctc 


45 


tcactacgag 


aaccgttcgc 


tgtatgcaat 




taccgaatgg 


ggcgtggtgc 


atcagcatcc 


50 


aggcggcaat 


atcggcgagc ggctgatggc 


tgccaccgcg 


cagaagtggc 


tggaagaggt 




gccgaccacc 


ttttccggcg gtatgcagca 


55 


gcatccgaag 


ctggtgttta 


tggatgaacc 




ccgcctgctc 


gacctgctgc gcggcctggt 
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gccgccgcac 


ttcacgcgcg 


gctacgggct 


9660 


ggcaatggcg 


ctggtcgatc 


gtgcgttgca 


9720 


cccggcgcag 


gatgaagagt 


ttgtgctggc 


9780 


tgtctcgcac 


ctcaaactcc 


cccactacgt 


9840 


acgtctgcaa 


caggagaaga 


accatggcta 


9900 


acgagcagac 


caaacgcatg 


atccgccgcg 


9960 


atcaggtgcc 


gtttggcggg 


cgcgagatgc 


10020 


tacagctcac 


cgccagcgtg 


attggcgaaa 


10080 


cggatgacac 


caccaacgcc 


gtgtcgattc 


10140 


acaccactga 


acgtacggac 


gatgcgacgc 


10200 


cgccgctgac 


cgaagatcag 


atcattatct 


10260 


ttatcgagcc 


gcgcgaaacg 


gaaacccgca 


10320 


tgcaggtgaa 


actgtatgaa 


gatatcgccc 


10380 


atccggtgaa 


ggtgaacggg 


cgctacgtaa 


10440 


acccaaaaat 


ggacatgatg 


cccgccctgc 


10500 


tctatgcggt 


gccgccgttt 


acccgcgtgg 


10560 


ccgttcagca 


gtgggatgag 


ccatgcgcca 


10620 


aagtggtgct 


ggatgacgcc 


ggaaaccgca 


10680 


gccaacagag 


cgaggcaaaa 


aaccaatgaa 


10740 


ccacctttac 


gcgccgggca 


aaggctttag 


10800 


agtgctgggc 


attgtcgggg 


aatccggctc 


10860 


cgcgcgcctg 


acgccgcagc 


agggggaaat 


10920 


gagcgaggcc 


gaccgccgtc gcctgctgcg 


10980 


actcgacggc 


ctgcgccgcc 


aggtgtcggc 


11040 


gaccggggca 


cgtcattacg gcgatattcg 


11100 


ggagattccc 


gccaaccgga 


tcgacgacct 


11160 


gcgtttgcag 


attgcccgca 


acctggtgac 


11220 


gaccggcggg 


ctggatgtgt 


cggtgcaggc 


11280 


ggtggagctg 


aacctcgcgg 


tggtgattgt 


11340 
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cacccatcrat 


ttaggcgtcg cccgcctgct ggcggaccgt ttgctggcga tgaagcaggg 


11400 




ocaa a fccicrtcr 


gagagtgggt taaccgaccg cgtgctcgac gacccgcatc atccgtatac 


11460 


5 


acagc tgctg 


gtgtcatcgg ttttgcagaa ttgagccggt gccggatgcg gcgtaaacgc 


11520 






ctacaaatgc gctccccgta ggtcggataa gacgcgtcag cgtcgcatcc 


11580 


10 




ccacaaqqcq aaaaatgatt aacgtacaaa acgtcagtaa aaccttcatc 


11640 




uy ca^L c*y 


aaaacqqcqt gcqcctgccc gtcctcaatc gcgcctcgct caccgtcaac 


11700 


15 


rrr , rrrrar , CTA A t" 


qcatqqtqct ccacggccat tccggcagcg gcaaatcaac tctgctacgc 


11760 


t. cy (_ uy uctt^-y 


ccaactatct acccqacqaa ggtcaaatcc agatcaaaca cggtgacgag 


11820 




^*T<T<Tt" ACTA rr 


tggtcaccgc gccagcgcgc aaagtggtgg aaatccgcaa aaccaccgtc 


11880 


20 


ggccgyy uyci 


accaatttct qcqcqtcatc ccgcgtatct cagcactgga agtggtgatg 


11940 






tcqataccqq cgttccgcgt gaagcctgcg ccgctaaagc cgcgcgtctt 


12000 


25 


ctcacccgcc 


tgaacgtgcc ggaacgcctg tggcacctgg caccatcgac attttccggt 


12060 


ggcgaacagc 


agcgcgtcaa catcgcccgc ggctttatcg tcgactaccc cattctgctg 


12120 




cttgacgaac 


ctaccgcctc gctggacgcc aaaaacagcg ccgcggtggt ggaactgatt 


12180 


30 


cgcgaagcca 


aaacccgtgg cgcagccatc gtaggcatct tccatgacga agctgtacgt 


12240 




aatgacgtcg 


ccgaccgcct gcacccaatg ggagcctctt catgattatc aataacgtta 


12300 


35 


agctggtgct 


ggaaaacgag gtggtaagcg gttcgctgga ggtgcagaac ggcgaaatcc 


12360 


gcgcctttgc 


cgaaagccag agccgcctgc cggaggcgat ggacggcgaa ggcggctggc 


12420 




tgctgccggg gctgattgag ctgcataccg ataatctgga taaattcttc accccgcgcc 


12480 


40 


cgaaagttga 


ctggcctgcc cactcggcga tgagcagcca cgacgcgctg atggtggcga 


12540 




gcggcatcac 


caccgtactg gatgccgtgg caattggcga cgtgcgcgac ggcggcgatc 


12600 


45 


ggctggagaa 


tctggagaag atgatcaacg ccatcgaaga gacgcagaaa cgcggcgtca 


12660 


accgcgccga gcaccgtctg catctgcgct gcgaactgcc gcatcacacc acgctgccgc 


12720 




tgtttgaaaa 


actggtgcag cgcgagccgg tgacgctggt gtcgctgatg gaccactcgc 


12780 


50 


cgggccagcg 


ccagttcgcc aaccgcgaga agtatcgcga atattatcag ggcaaatact 


12840 




ccctcactga 


tgcgcagatg cagcagtacg aagaagagca actggcgccc gccgcacgct 


12900 


55 


ggtcgcagcc 


yaatcgcgaa tccat gccg ccctgtgccg cgcgcgaaaa attgcgcttg 


12960 


ccagccacga 


tgacgccacc cacgcccacg ttgctgaatc tcaccagctt ggcagcgtga 


13020 
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tcgccga»tt tcccacoacg tccgaagcgg cggaagcctc gcgcaagcat ggcatgaacg 13080 
tgctgatggg cgcgccgaat attgtgcgcg gcggctcgca ctccggcaac gtggcggcca 13140 
gtgaactggc gcagcttggc ctgctggata tcctctcttc cgactactac cccgccagcc 13200 
tgctcgatgc g^catttcgc gtcgccgatg acgagagcaa ccgctttacg ctgccgcagg 13250 
cggtgaagct ggtgactaaa aatccagcgc aggcgcttaa tctccaggat cgcggggoga 13320 , 
ttggcgaggg caaacgcgcc gacctggtgc tggcgcatcg caaggacaat catattcata 13380 
tcgaccacgt ctggcgtcag ggtaaaaggg tgttctgatg atgggaaaac cgatttggtt 13440 
aatggggccg tccggctccg ggaaagacag cctgctggcg gaactccgcc tgcgggaaca 13500 
aactcagtta ctggtggcgc atcgccacat cacgcgcgat gccagcgccg gaagtgaaaa 13560 
ccatatcgcc ctgagcgagc aggagttttt tacccgcgcg gggcaaaatc tgttggcctt 13620 
aagctggcac gctaacggtc tgtattatgg cgtcggcgtc gagattgatc tctggctgca 136 80 
cgccggattc gacgtgctgg tcaacggctc acgcgcccat ctgccgcagg cgcgggcgcg 13740 
ctatcaatcg gcgctgctgc ccgtctgttt acaggtttcg ccggagatcc tccgccagcg 13800 
cctggaaaac cgtggccgtg aaaacgccag tgaaaCtaac gcccgcctgg cgcgcgccgc 13 860 
ccgctatact ccacaggatt gccatacgct caacaatgac ggcagcctgc gccagtcggt 13920 
cgacacgctg ctgacgctga tccatcagaa ggagaaacac catgcctgct tgtgagcttc 13980 
gcccggccac gcagtacgac accgacgcgg tttacgcgct efatttgtgag ctaaaacagg 14040 
cggagtttga ccaccacgcg tttcgcgtgg gttttaacgc caatctgcgc gacccaaaca 14100 
tgcgctacca tctggcgctg cttgatggcg aagttgtcgg catgatcggc ctgcatttgc 14160 
agtttcatct gcatcatgtc aactggatcg gcgaaattca ggagttggtg gtaacgccgc 14220 
aggcgcgcgg tctgaacgtc ggcagtaagt tactggcgtg ggcagaagaa gaagcccgcc 14280 
aggccggggc cgaaatgacc gaactttcga ccaacgtgaa gcgccacgac gcgcaccgtt 14340 
tctatctgcg cgaaggctac gagcagagcc acttccgctt caccaaggcg ctgtaacatg 14400 
agcctgaccc tcacgctcac cggcaccggc ggcgcacagg gcgttccggc atggggctgc 14460 
gagtgtgcgg cctgcgccag agcgcggcgc tcgccgcagt atcgccgcca accgtgcagc 14 520 
ggcgtagtga agtttaacga cgcaatcacc ccgatcgacg ccgggctgca cgatctcgcc 14 580 
gatcgctggt cgcccggatc gttccagcag tttttgctga cgcattatca tatggatcac 14640 
gtccaggggc tgtttccgct gcgctggggc gctggcgatc cgatcccggt ttacggcccg 14700 
ccggatgaac agggctgcga cgatctgttt aaacatccgg gcctgcttga tttcagccac 14760 
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acggtggaac cgtttgtggt gtttgatttg caggggttac aggtcacgcc cctgccgctc 14d20 

aaccactcaa aactgacctt cggttatctg ctggaaacgg cacacagccg ggtggcgtgg 14880 

ctgtctgaca ccgcaggctt gccggaaaaa acgctgaaat ttttacgcaa taatcagccg 14 94 0 

caggtaatgg tgatggattg cagtcacccg ccgcgcgcgg atgcaccgcg taatcactgt 15000 

10 gatttaaata ccgtgcttgc gctgaatcag gttatccgct cgccacgggt gattctgacc 15060 

catatcagcc accagtttga tgcgtggctg atggaaaacg cactaccgtc agggtttgag 15120 

gtggggtttg atgggatgga gattggggtg gcgtgatgag agggaatgtg cgcgctggcc 15180 

ccctcaccct aaccctctcc ccagaggggc gaggggaccg attgtgctcg atattgaata 15240 

ttgcgcccgt tttctccctc tccccattgg ggtgaggggc gatgcctgct ccatacccaa 15300 

20 cctcatcgcc catactcatc ttccattctc cgctcttcat cctccagttg ccgacgctcc 15360 

tgatcaagct ggcgctggcg atcgtccagc tgcctgcggc gatcttcaaa ctggcggcgg 15420 

cggtcgtcat attgtctgcg ccgatcgtcg ctcacttcac gctgccagcc gtggtcgcgc 15480 

" gaatcttcat agttgaagcg gcgcacgaaa aacgcgaaag cgtttcacga taaatgcgaa 15540 

aactttagct ttcgcgcttc aaatgaaaca gatgtattaa ttactgcttt ttattcatta 15600 

30 catggggatc c , 15611 

<210> 2 , 
<211> 11672 
35 <212> DNA 

<213> Escherichia coli 

<400> 2 

gaattcccgt taactcttca tctgttagtc acttttaatt aaccaaatcg tcacaataat 60 

40 

ccgccacgat ggagccactt ttttagggag gctgcatcat gcaaacgatt atccgtgtcg 120 
agaagctcgc caaaaccttc aatcagcatc aggcgctgca tgcggttgat ctgaacattc 180 

45 atcacggtga aatggtggct ctgcttgggc cgtcgggttc cggaaaatcc acccttttac 240 
gtcacttaag cggtttgatt accggcgata aatctgtcgg tagccatatc gagctgctgg 3 00 
gccgcacagt ccagcgcgaa ggccgcctgg cccgcgatat ccgcaaaagc cgcgcccata 360 

50 ccggctacat attccaacaa ttcaacctgg tgaaccgcct gagcgtactg gagaacgtgc 420 
tgattggcgc gctcggcagc acgccgttct ggcgcacctg ttttagctgg ttcaccggcg 480 

55 agcagaaaca gcgcgcgtta caggcgctga cccgcgttgg catggtgcat tttgcccatc 540 
agcgcgtttc caccctctcc ggcggccagc agcaacgtgt ggcgattgcc cgtgcgctga 600 
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tgcagcaggc gaaagcgatc ccggccgatg aacccaccgc cccgccggac ccagaatcag 660 
cgcgcatcg^ gatggacacc ctgcgcgaca tcaaccagaa cgacggcacc accgcggtcg 720 
tcacgctgca tcaggtggat tacgccctgc gctactgcga acgcatcgtc gccctgcgcc 780 
aggggcacgt cttctacgac ggcagcagcc aacagtttga taacgaacgt tttgaccacc 840 

io tctaccgcag cattaaccgc gtcgaagaga acgcgaaagc tgcctgacac ccccatcatc 900 
' gaggaaaacg aatgaacgct aagataactg cctcgotggc cttcaccagc atgttcagcc 960 
tcagcaccct gttaagcccg gcgcacgccg aagagcagga aaaggcgttg aatttcggca 1020 

15 ttatttcaac ggaatcacag caaaacctga aaccgcaatg gacgccgttc ttgcaggata 1080 
tggagaagaa gctgggcgtg aaggtcaacg ccttctttgc cccggactac gcgggcatta 1140 

20 tccaggggac gcgcttcaat aaagtggata tcgcctggta cggcaatctg ccggcgatgg 1200 
aagcggtgga tcgcgccaat ggccaggtct tcgcccagac ggtcgcggcg gatggatcgc 1260 
cgggttactg gagcgtgttg atcgtcaaca aagacagtcc gatcaacaac ctgaacgatc 1320 

25 tgctggcgaa gcggaaagat ctcacctttg gcaatggcga tcctaactcc acctctggct 1380 
tcctcgtccc cggctactac gtcttcgcca aaaacaatat ctocgccagc gacttcaagc 1440 

30 gcaccgtcaa cgccgggcat gaaaccaacg cgctggccgt cgccaacaag caggtggatg 1500 
ttgccaccaa caacaccgaa aacctcgaca agctgaaaac ctccgcgcca gagaagctga 1560 
aagaactgaa ggtgatctgg aagtcgccgc tgatcccagg cgatccgatc gtctggcgca 1620 

35 agaatctttc cgaaaccacc aaagacaaga tctacgactt ctttatgaac tacggcaaaa 1680 
cgccggaaga aaaagcggtg ctggaacgcc tgggctgggc gccattccgc gcttccagcg 1740 

40 acctgcaact ggtgccgatt cgccagctcg cgctgtttaa agagatgcag ggcgtgaaaa 1800 
gcaataaagg actgaatgag caggacaagc tggcaaaaac caccgagatt caggcgcagc 1860 
tggatgacct ggaccgcctg aacaacgcgc taagcgcgat gagttcggtg agtaaagcgg 1920 

45 

tgcagtaaat cgtaggtcgg ataagacgcc ccggcgCcgc atccgacaat gtgcaggcgt 1980 
tgatgccgga tgcggtgcaa gcaccttatc cggcctacag accggagcca aacatgcaaa 2040 

50 ccatcaccat cgccccaccc aagcgcagct ggttctcgct tctgagctgg gccgttgtac 2100 
tcgccgtgtt ggtcgtctcg tggcagggcg cggaaatggc cccgcttacg ctgatcaaag 2160 
acggcggcaa catggcgacg ttcgccgccg acttcttccc gcccgatttc agccagtggc 2220 

" aggattacct caccgaaatg gccgtcacgc tgcaaatcgc cgtctggggc accgcgctgg 2280 
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cggtggttct ctccatcccc tttggcctga tgagcgccga aaacctggtg ccgtggcggg 2340 
tttaccagcc cgttcgccgc ctgatggacg cctgccgcgc cattaacgaa atggtcttcg 2400 
ccacgctgtt cgtggtcgcc gtcggcctcg gcccgttcgc tggcgtgccg gcgtgctggc 2460 
gctgtttatc cacaccaccg gcgtgctctc caagctgctt tccgaagcgg cggaagcgat 2520 
tgagcccggc ccggtggaag gcattcgcgc caccggtgcc aacaagctcg aagagacoct 2580 
• ctacggcgtg ctgccacagg tgatgccact gctgatctcc tactcccCct atcgcttcga 2640 
atccaacgcc cgctcggcga ccgtcgtcgg catggtcggc gcaggcggga tcggcgtcac 2700 
cctgtgggaa gcgattcgcg gtttcoagtt ccaacaaacc tgcgccctga tggtgcttat 2760 
catcgtcacg gtcagcctgc tggatttcct ctctcaacgg ttgcgtaagc actttatctg 2620 
ataagcgagg cattgatatc tatgcacttg tctacacatc cgaccagcta cccaacacgc 2880 
tatcaagaga tagccgcaaa acttgagcag gagctccgtc aacactaccg ctgcggcgac 2940 
tatcttcccg ccgagcagca actggcagcg cgctttgagg tgaatcgcca caccctgcgc 3000 
cgcgccatcg accaactggt ggaaaaaggc tgggtacagc gccgtcaggg cgtcggcgcg 3060 
ctggtgctga tgcgcccgtt cgattacccg ctcaacgccc aggcgcgttt tagccagaat 3120 
ctgctggatc agggcagcca tcccaccagc gaaaaactgc tttcggtatt gcgccccgcg 3180 
tccggccacg tcgctgacgc actggggatt accgaggggg agaacgtcat ccacctgcgc 3240 
accctgcgtc gtgtcaacgg cgtcgcgctc tgtttaatcg accactactt cgcggacctc 3300 
accctotggc cgacgctgca acgcttcgac agcggctcgc tgcacgattt tctgcgcgag 3360 
caaaccggaa ttgcgctgcg ccgcagocag acgcggatca gcgcccgccg cgcccaggcc 3420 
aaagagtgcc agcgtcttga aatcccgaat atgtcgccgc tgctgtgcgt gcgcaccctt 3480 
aaccaccgtg acggtgaaag cagcocggcg gagtactccg tcagcctgac gcgcgccgac 3540 
acgattgaat tcactatgga gcactgaatg cacgcagata ccgcgacccg ccagcactgg 3600 
atgtccgtgc tggcgcacag ccaaccggct gaactggcag cacgcctgaa cgcgctaaac 3660 
atcacogccg actatgaggt gatccgcgcc gctgaaaccg gcctggcaca gattcaggcg 3720 
cggatgggcg gcaccggcga acgttttttt gccggcgacg ccacgctgac ccgcgccgcc 3780 
gtgcgcctga ctgacggcac gctcggctac agctgggtgc aggggcgcga taaacagcac 3840 
gccgaacgct gcgcgctgat tgacgcgctg atgcagcaac ctcgccactt tcaaaactta 3900 
tcagaaaccc ttattgcccc gctggacgct gaccgtatgg cacgcattgc cgcacgccag 3960 
gccgaagtga acgccagccg ggtcgacttc tttacgatgg ttcgcggaga caaogcatga 4020 
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ccctggaaac cgcttttatg ctucccgtgc 
taaaggccat gagcgagccg ggcgtgattg 

5 

aaccgctgaa tatcgccacc accagcgtgc 
tgtggctttc taccccatta aacaacgata 
10 acgcgccgct ggtcagccag ccggaacagg 
ccagcgaaca gctcaacgcc ctttccaccg 
cgctgatttt acaggtcgcc agcctgagcg 

15 

gtattgccga agaacgaatg atcgctccgc 
ccgagcgccc gcatccgttc ccgctcggca 
20 tgctggctat tccgcgaacc actcatgtgg 
gggcgagaag gcgatcgacg ccgcccacgc 
cgatttgccc gaactgagcg tcgcccagat 

25 

cgtgatgacc gaaggcggca ttgccgaccg 
cagcggcgat aacgttgaag cgattttcct 
30 gctggcggta agcgagccgc tcgacaccac 
cgtttataaa gacattcccg gcggccagct 
cctgctcgat tttaccctgc tggcaaacgg 

35 

cgaacagcag ccgtcgccgc acgttttcag 
tgaagaggat agcggcgcac agccggatga 
40 ctcacgctcc tcccgtttgc agcagttgat 
ggcctactcc acccaacgcg gttacgggcg 
cggctatatc gacgtgtcga ttgtgccgga 

45 

actgctgatg actgagtgtg aaatggttaa 
gcacttcacg cgcggctacg ggctggtgtt 
50 ggcgctggtc gaccgcgctc tgcaagcccc 
gcaggatgaa gagttcgtgc tggcacatgc 
acacctcaaa ctcccccact acgtcgattt 

55 

gcaacaggag cagaaccatg gctaatctga 



aggatgccca 


gcacagtttt 


-gtcgcctgt 


4080 


tcgccctgca 


ccagctcaaa 


cgcggccggc 


4140 


tgctgacgcc 


ggccgataac 


gacacgccgg 


4200 


tcgtcaacca 


gagcctgcgt 


tttcatacca 


4260 


cgaccttcgc 


ggtgacggat 


gaggcgattt 


4320 


gcaccgccgt 


tgcgccggaa 


gcgggtgcga 


4380 


gcggacgcat 


gttgcgcctt 


actggtgcgg 


4440 


agctgccgga gtgcattctg 


cacgaactca 


4500 


tcgacctgat 


cctgacctgt 


ggcgagcgcc 


4560 


aggtgtgctg 


atgtacgttg 


ccgtgaaagg 


4620 


cctgcaagag 


agccgacgcc 


gaggcgatac 


4680 


tgaacagcag 


cttaacctcg 


cggtagatcg 


4740 


cgaactggcg 


gcgctggcgc 


tgaaacaggc 


4800 


gctgcgcgcc 


taccgcacca 


cgttggcgaa 


4860 


cgggatgcgt 


ctcgaacgcc 


gtatctccgc 


4920 


gcttggccca acctacgact 


acacccatcg 


4980 


cgaagcgccg 


acgctgacca 


ccgccgacag 


5040 


cctgctggcg 


cgtcaggggc 


tggcgaagtt 


5100 


catcacccgc 


acgccgccgg 


tttacccctg 


5160 


gcgcggcgac 


gaaggctatt 


tgctggcgct 


5220 


caatcacccg 


ttcgcaggcg 


agauccgcag 


COQA 
J Z D U 


agagctggga 


tttgcggcga 


acgtcggcga 


5340 


cggttttatc gacccgccgg 


gtgagccgcc 


5400 


cggcatgagc 


gagcgcaaag 


cgatggcgat 


5460 


ggagtacggc 


gagcacgcga 


caggcccggc 


5520 


cgacaacgtc gaagccgcag 


gctttgtctc 


5580 


ccaggccgaa 


ctggagctac 


tcaaacgtct 


5640 


gcggctacaa 


ctttgcctac 


ctcgacgagc 


5700 
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agaccaaacg 


catgatccgc 




tgccgtttgg 


cgggcgcgag 


5 


ttaccgccag 


cgcgattggc 




acaccaccaa 


cgccgtgtcg 


10 


cggaacgtac 


ggacgatgcg 


■ tgaccgaaga 


tcagataatt 




agccgcgcga 


aacggaaacc 


15 


tgaaactgta 


tgaagatatc 




tgaaggtaaa 


tgggcgccac 


20 


aaatggacat 
cggtgccgcc 


gacgcccgcc 
gtttacccgc 




agcagtggga 


tgagccatgc 


25 


tgctggatga 


cgccggaaac 




agagcgaggc 


aaaaaaccaa 


30 


ttacgcgccg 
gggcattgtc 


ggcaaaggct 
ggggaatccg 




cctgacgccg 


cagcaggggg 


35 


ggccgaccgc 


cgtcgcctgc 




cggcctgcgc 


cgccaggtgt 


40 


ggcacgtcat 
tcccgccaac 


tacggcgata 
cggatcgacg 




gcagattgcc 


cgcaacctgg 


45 


cgggctggat 


gtgtcggtgc 




gctgaacctc 


gcggtggtga 


50 


ccgtttgctg 
cgacgacccg 


gtgatgaagc 
catcatccgt 




cggtgccgga 


tgcggcgtaa 


55 


ataagacgcg 


tcagcgtcgc 




caaaacgtca 


gtaaaacctt 



- 14- 

cgcgccatct taaaagcggt 
atgccgatgc cgtacggctg 
gaaagcgacg tgctgaaggc 
attcgcaact tcttcaagcg 
acggttatcc agacgcgtca 
atcttccagg tgccaatccc 
cgcaccatgc acgcgctgga 
gcccgcttcg gtcatatcgc 
gtgatggacc cgtcgccgat 
ctgcaactgt tcggcgcggg 
gtggaaagtc tcgatttcga 
gccatctgcg gatcgaccca 
cgcatgtttg tctgctccga 
tgaatcaacc gttactttcg 
ttagcgatgt ctcttttgat 
gctccgggaa gaccacgctg 
aaattcacta cgagaaccgt 
tgcgtaccga atggggcgtg 
cggcaggcgg caatatcggc 
ttcgtgccac cgcgcagaag 
acctgccgac caccttttcc 
tgacgcatcc gaagctggtg 
aggcccgcct gctcgacctg 
ttgtcaccca tgatttaggc 
aggggcaagt ggtggagagt 
atacacagct gctggtgtca 
acgccttatc cggcctacaa 
atccgacacc cgaaccacga 
catcctgcac cagcaaaacg 



ggcgaccccc 


ggttatcagg 


5760 


gggaaccggc 


ggcattcagc 


5820 


gattgaccag 


ggcgcggatg 


5880 


cgtgaccggg 


gtaaacacca 


5940 


ccgcatcccc 


gaaacgccgc 


6000 


cgagccgctg 


cgctttatcg 


6060 


agagtacggc 


gtgatgcagg 


6120 


caccacctac 


gcctatccgg 


6180 


cccgaaattc 


gataacccaa 


6240 


gcgcgagaag 


cgcatctatg 


6300 


cgatcacccg 


ttcaccgttc 


6360 


cagctatctt 


gatgaagtgg 


6420 


taccgattat 


tgccgccaac 


6480 


gtcaataacc 


tgacccacct 


6540 


ttatggccgg 


gggaagtgct 


6600 


ctgaagtcga 


tctccgcgcg 


6660 


tcgctgtatg 


caatgagcga 


6720 


gtgcatcagc 


atccactcga 


6780 


gagcggctga 


tggcgaccgg 


6840 


tggctggaag 


aggtggagat 


6900 


ggcggtatgc 


agcagcgttt 


6960 


tttatggatg 


aaccgaccgg 


7020 


ctgcgcggcc 


tggtggtgga 


7080 


gtcgcccgcc 


tgctggcgga 


7140 


gggttaaccg 


accgcgtgct 


7200 


tcggttttgc 


agaattgagc 


7260 


atgcgctccc 


cgtaggtcgg 


7320 


ggcgaaaaat 


gattaacgta 


7380 


gcgtgcgcct 


gcccgtcctc 


7440 
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aatcgcgcct cgctcascgt caacgcgggc gaatgcgtgg tgctccacgg ccaccccggc 7500 
agcggcaaat^caactctgct acgctcgctg cacgccaacc atctgcccga cgaaggtcaa 7560 
atccagatca aacacggtga cgagtgggta gacctggtca ccgcgccagc gcgcaaagtg 7620 
gtggaaatcc gcaaaaccac cgtcggctgg gtgagccagt ttctgcgcgt catcccgcgt 7680 
, atctcagcac tggaagtggt gatgcagccg ctgctcgata ccggcgttcc gcgtgaagcc 7740 
•tgcgccgcta aagcogcgcg tcttctcacc cgcctgaacg tgccggaacg cctgtggcac 7800 
ctggcaccat cgacattttc cggtggcgaa cagcagcgcg ccaacatcgc ccgcggcttt 7860 
atcgtcgact accccattct gctgcttgac gaacctaccg cctcgctgga cgccaaaaac 7920 
agcgccgcgg cggtggaacc gattcgcgaa gccaaaaccc gtggcgcagc catcgtaggc 7980 
20 atcttccatg acgaagctgt acgtaatgac gtcgccgacc gcccgcaccc aatgggagcc 8040 
tcttcatgat tatcaataac gttaagctgg tgctggaaaa cgaggtggta agcggttcgc 8100 
tggaggtgca gaacggcgaa atccgcgcct ttgccgaaag ccagagccgc ctgccggagg 8160 
" cgatggacgg cgaaggcggc tggctgctgc cggggctgat tgagctgcac accgataatc 8220 
tggataaatt cttcaccccg cgcccgaaag ttgactggcc tgcccactcg gcgatgagca 8280 
30 gccacgacgc gctgatggtg gcgagcggca tcaccaccgt actggatgcc gtggcaactg 8340 
gcgacgtgcg cgacggcggc gatcggctgg agaatctgga gaagatgatc aacgccatcg 8400 
aagagacgca gaaacgcggc gtcaaccgcg ccgagcaccg tctgqatctg cgctgcgaac 8460 
" tgccgcatca caccacgctg ccgctgtttg aaaaactggt gcagcgcgag ccggtgacgc 8520 
tggtgtcgct gatggaccac tcgccgggcc agcgccagtt cgccaaccgc gagaagtatc 8580 
40 gcgaatatta tcagggcaaa tactccctca ctgatgcgca gatgcagcag tacgaagaag 8640 
agcaactggc gctcgccgca cgctggtcgc agccgaatcg cgaatccatc gccgccctgt 8700 
gccgcgcgcg aaaaattgcg cttgccagcc acgacgacgc cacccacgcc cacgttgctg 8760 
" aatctcacca gcttggcagc gtgatcgccg aatttcccac cacgttcgaa gcggcggaag 8820 
cctcgcgcaa gcatggcatg aacgtgctga tgggcgcgcc gaatattgtg cgcggcggct 8880 
50 cgcactccgg caacgtggcg gccagtgaac tggcgcagct tggcctgctg gatatcctct 8940 
cttccgacta ctaccccgcc agcctgctcg atgcggcatt tcgcgtcgcc gatgaccaga 9000 
gcaaccgctt cacgctgccg caggcggtga agctggtgac taaaaatcca gcgcaggcgc 9060 
" ttaatctcca ggatcgcggg gtgattggcg agggcaaacg cgccgacctg gtgctggcgc 9120 
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atcgcaagga 


caatcatatt 


catatcgacc 


acgtctggcg 


tcagggtaaa 


agggtgttct 


9180 




gatgatggga 


aaactgattt 


ggttaatggg gccgtccggc 


cccgggaaag acagcctgct 


9240 


5 


ggcggaactc 


cgcctgcggg 


aacaaactca 


gttactggtg 


gcgcatcgct 


acatcacgcg 


9300 




cgatgccagc 


gccggaagtg 


aaaaccatat 


cgccctgagc 


gagcaggagt 


tttttacccg 


9360 


10 


cgcggggcaa 


aatctgttgg 


ccttaagctg 


gcacgctaac 


ggtctgtatt 


atggcgtcgg 


9420 


cgtcgagatt 


gatetctggc 


tgcacgccgg 


attcgacgtg 


ctggtcaacg 


gctcacgcgc 


9480 




ccatctgccg 


caggcgcggg 


cgcgctatca 


atcggcgctg 


ctgcccgtct 


gtttacaggt 


9540 


15 


ttcgccggag 


atcctccgcc 


agcgcctgga 


aaaccgtggc 


cgtgaaaacg 


ccagtgaaat 


9600 




taacgcccgc 


ctggcgcgcg 


ccgcccgcca 


tactccacag 


gattgccata 


cgctcaacaa 


9660 


20 


tgacggcagc 


ctgcgccagt 


cggtcgacac 


gctgctgacg 


ctgatccatc 


agaaggagaa 


9720 


acaccatgcc 


tgcttgtgag 


cttcgcccgg 


ccacgcagta 


cgacaccgac 


gcggtttacg 


9780 




cgctgatttg 


tgagctaaaa 


caggcggagt 


ttgaccacca 


cgcgtttcgc 


gtgggtttta 


9840 


25 


acgccaatct 


gcgcgaccca 


aacatgcgct 


accatctggc 


gctgcttgat 


ggcgaagttg 


9900 




tcggcatgat 


cggcctgcat 


ttgcagtttc 


atctgcatca 


tgtcaactgg 


atcggcgaaa 


9960 


30 


ttcaggagtt 


ggtggtaatg 


ccgcaggcgc 


gcggtctgaa 


cgtcggcagt 


aagttactgg 


10020 


cgtgggcaga 


agaagaagcc 


cgccaggccg 


gggccgaaat 


gaccgaactt 


tcgaccaacg 


10080 




tgaagcgcca 


cgacgcgcac 


cgtttctatc 


tgcgcgaagg 


ctacgagcag 


agccacttcc 


10140 


35 


gcttcaccaa 


ggcgctgtaa 


catgagcctg 


accctcacgc 


tcaccggcac 


cggcggcgca 


10200 




cagggcgttc 


cggcatgggg 


ctgcgagtgt gcggcctgcg 


ccagagcgcg 


gcgctcgccg 


10260 


40 


cagtatcgcc 


gccaaccgtg 


cagcggcgta 


gtgaagttta 


acgacgcaat 


caccctgatc 


10320 


gacgccgggc 


tgcacgatct 


cgccgatcgc 


tggtcgcccg 


gatcgttcca 


gcagtttttg 


10380 




ctgacgcatt 


atcatatgga 


tcacgtccag gggctgtttc 


cgctgcgctg 


gggcgttggc 


10440 


45 


gatccgatcc 


cggtttacgg 


cccgccggat 


gaacagggct 


gcgacgatct 


gtttaaacat 


10500 




ccgggcctgc 


ttgatttcag 


ccacacggtg gaaccgtttg 


tggtgtttga 


tttgcagggg 


10560 


50 


ttacaggtca 


cgcccctgcc 


gctcaaccac 


tcaaaactga 


ccttcggtta 


tctgctggaa 


10620 


acggcacaca 


gccgggtggc 


gtggctgtct 


gacaccgcag 


gtttgccgga aaaaacgctg 


10680 




aaatttttac 


gcaataatca 


gccgcaggta 


atggtgatgg 


attgcagtca 


cccgccgcgc 


10740 


55 


gcggatgcac 


cgcgtaatca 


ctgtgattta 


aataccgtgc 


ttgcgctgaa 


tcaggttatc 


10800 




cgctcgccac 


gggtgattct 


gacccatatc 


agccaccagt 


ttgatgcgtg 


gctgatggaa 


10860 
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aacgcactac cgtcagggtt tgaggtgggg 

tgagagggaa tgtgcgcgct ggccccctca 

5 

accgattgtg ctcgatattg aatattgcgc 

gggcgatgcc tgctccatac ccaacctcat 

10 tcatcctcca gttgccgacg ctcctgatca 

cggcgatctt caaactggcg gcggcggtcg 

tcacgctgcc agccgtcgtc gcgcgaatct 

15 

gcgtcattaa tcgcctgctg aatattgcca 

ggaacgtgga cagcgtgagc agggtgaata 

20 aacctcaaaa agaaactcta tccacattaa 

atctcaattt ctccgctgct ctattgccgt 

gccttcgtac tgtgcgccca gccacgcatc 

25 

cactcgtgaa ccaaaagcca gcacgttggt 
ataaggttcg ctacagacga ccgcgcgaat 

30 

<211> 435 
<212> DNA 

<213> Escherichia coli 

35 <400> 3 

atgcctgctt gtgagcttcg cccggccacg 

atttgtgagc taaaacaggc ggagtttgac 

40 aatctgcgcg acccaaacat gcgctaccat 

atgatcggcc tgcatttgca gtttcatctg 

gagttggtgg taatgccgca ggcgcgcggt 

45 

gcagaagaag aagcccgcca ggccggggcc 
cgccacgacg cgcaccgttt ctatctgcgc 
50 accaaggcgc tgtaa 



<210> 4 
<211> 144 
55 <212> PRT 

<213> Escherichia coli 



tttgatggga tggagattgg ggtggcgtga 10920 
ccctaaccct ctccccagag gggcgagggg 10980 
tcgttttctc cctctcccca ttggggtgag 11040 
cgcccatact catcttccat tctccgctct 11100 
agctggcgct ggcgatcgtc cagctgcctg 11160 
tcatattgtc tgcgccgatc gtcgctcact 11220 
tcatagtctc gcccacggtc agggttataa 11280 
atggtgtcgt cgataatatc ggcctgggcc 11340 
aaagaaatag cggaaagcgt ttcattagcc 114 00 
tcattactca tccatgcaag tagtggatga 11460 
aatcgcctcc acgcgttgtt gatgacgacc 11520 
cacaatcatt tttgccagtt cgaggccaac 11580 
gtcgttatgc tgccgcgaaa gttgcgcgga 11640 
te 11*72 



cagtacgaca ccgacgcggt ttacgcgctg 60 
caccacgcgt ttcgcgtggg ttttaacgcc 120 
ctggcgctgc ttgatggcga agttgtcggc 180 
catcatgtca actggatcgg cgaaattcag 240 
ctgaacgtcg gcagtaagtt actggcgtgg 300 
gaaatgaccg aactttcgac caacgtgaag 360 
gaaggctacg agcagagcca cttccgcttc 420 

435 
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<4C0> 4 

Met Pro Ala Cys Glu Leu Arg Pro Ala Thr Gin Tyr Asp Thr Asp Ala 
15 10 15 

5 

Val Tyr Ala Leu lie Cys Glu Leu Lys Gin Ala Glu Phe Asp His His 
20 25 30 

Ala Phe Arg Val Gly Phe Asn Ala Asn Leu Arg Asp Pro Asn Met Arg 
10 t 35 40 45 

Tyr His Leu Ala Leu Leu Asp Gly Glu Val Val Gly Met He Gly Leu 
50 55 60 

15 His Leu Gin Phe His Leu His His Val Asn Trp He Gly Glu He Gin 
65 70 75 80 

Glu Leu Val Val Met Pro Gin Ala Arg Gly Leu Asn Val Gly Ser Lys 
85 90 95 

Leu Leu Ala Trp Ala Glu Glu Glu Ala Arg Gin Ala Gly Ala Glu Met 
100 105 110 



20 



Thr Glu Leu Ser Thr Asn Val Lys Arg His Asp Ala His Arg Phe Tyr 
25 115 120 125 

Leu Arg Glu Gly Tyr Glu Gin Ser His Phe Arg Phe Thr Lys Ala Leu 
130 ' 135 140 

30 

<210> 5 

<211> 20 ' 
<212> DNA 
35 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : synthetic 
oligonucleotide 



40 



<400> 5 

aaacaccatg gctgcttgtg 



45 <210> 6 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
50 <220> 

<223> Description of Artificial Sequence : synthetic 
oligonucleotide 

<400> 6 

55 gtgacgaatt cgagctcatt acagcgcctt ggtga 3 5 
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10 



35 



<210> 7 
<211> 435 
<212> DNA 

<213> Artificial Sequence 
<220> 



<zzv> 

<223> Description of Artificial Sequence :non-naturally 
occurring nucleotide sequence encoding modified 
PhnO protein P2A; g-c at nucleotide position 4 



48 



96 



144 



. <220> 
<221> CDS 
<222> (1) . . (432) 

15 <400> 7 

atg get get tgt gag ctt cgc ccg gec acg cag tac gac acc gac gcg 

Met Ala Ala Cys Glu Leu Arg Pro Ala Thr Gin Tyr Asp Thr Asp Ala 

15 10 15 

20 gtt tac gcg ctg att tgt gag eta aaa cag gcg gag ttt gac cac cac 
Val Tyr Ala Leu lie Cys Glu Leu Lys Gin Ala Glu Phe Asp His His 
20 25 30 

gcg ttt cgc gtg ggt ttt aac gec aat ctg cgc gac cca aac atg cgc 
25 Ala Phe Arg Val Gly Phe Asn Ala Asn Leu Arg Asp Pro Asn Met Arg 
35 40 45 

tac cat ctg gcg ctg ctt gat ggc gaa gtt gtc ggc atg ate ggc ctg 192 
Tyr His Leu Ala Leu Leu Asp Gly Glu Val Val Gly Met He Gly Leu 
30 50 . 55 60 

cat ttg cag ttt cat ctg cat cat gtc aac tgg ate ggc gaa att cag 240 
His Leu Gin Phe His Leu His His Val Asn Trp lie Gly Glu He Gin 
65 70 75 80 

gag ttg gtg gta atg ccg cag gcg cgc ggt ctg aac gtc ggc agt aag 
Glu Leu Val Val Met Pro Gin Ala Arg Gly Leu Asn Val Gly Ser Lys 
85 90 95 

40 tta ctg gcg tgg gca gaa gaa gaa gec cgc cag gee ggg gee gaa atg 336 
Leu Leu Ala Trp Ala Glu Glu Glu Ala Arg Gin Ala Gly Ala Glu Met 
100 105 HO 

acc gaa ctt teg acc aac gtg aag cgc cac gac gcg cac cgt ttc tat 384 
45 Thr Glu Leu Ser Thr Asn Val Lys Arg His Asp Ala His Arg Phe Tyr 
115 120 125 

ctg cgc gaa ggc tac gag cag age cac ttc cgc ttc acc aag gcg ctg 432 
Leu Arg Glu Gly Tyr Glu Gin Ser His Phe Arg Phe Thr Lys Ala Leu 
50 130 ' 135 140 



288 



taa 



435 



55 <210> 8 

<211> 144 
<212> PRT 
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<213> Artificial Sequence 
<400> 8 

Met Ala Ala Cys Glu Leu Arg Pro Ala Thr Gin Tyr Asp Thr Asp Ala 
5 15 10 15 

Val Tyr Ala Leu He Cys Glu Leu Lys Gin Ala Glu Phe Asp His His 
20 25 30 

10 Ala Phe Arg Val Gly Phe Asn Ala Asn Leu Arg Asp Pro Asn Met Arg 
35 40 45 

Tyr His Leu Ala Leu Leu Asp Gly Glu Val Val Gly Met He Gly Leu 
50 55 60 

15 

His Leu Gin Phe His Leu His His Val Asn Trp He Gly Glu He 
65 70 75 80 

Glu Leu Val Val Met Pro Gin Ala Arg Gly Leu Asn Val Gly Ser Lys 
20 8 5 90 95 

Leu Leu Ala Trp Ala Glu Glu Glu Ala Arg Gin Ala Gly Ala Glu Met 
100 105 HO 



25 



30 



Thr Glu Leu Ser Thr Asn Val Lys Arg His Asp Ala His Arg Phe Tyr 

115 120 125 

Leu Arg Glu Gly Tyr Glu Gin Ser His Phe Arg Phe Thr Lys Ala Leu 
130 135 140 



<210> 9 ' . 

<211> 264 t 
<212> DNA 
35 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : transit peptide 
coding sequence 

40 

<220> 

<221> CDS 

<222> (1) . . (264) 

45 <400> 9 

atg get tec tct atg etc tct tec get act atg gtt gec tct ccg get 

Met Ala Ser Ser Met Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala 
15 10 15 

50 cag gec act atg gtc get cct ttc aac gga ctt aag tec tec get gec 
Gin Ala Thr Met Val Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala 
20 25 30 

ttc cca gec ace cgc aag gcz aac aac gac att act tec ate aca age 
55 Phe Pro Ala Thr Arg Lys Ala Asn Asn Asp He Thr Ser He Thr Ser 
35 40 45 



48 



96 



144 
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aac ggc gga aga gtt aac tgc acg cag gtg tgg cot ccg act gga aag 192 

Asn Gly Gly Arg Val Asn Cys Met Gin Val Trp Pro Pro lie Gly Lys 

50 55 60 

5 aag aag ttt a -ag act etc tct tac etc cct gac etc acc gat tec ggt 240 

Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro Asp Leu Thr Asp Ser Gly 

65 70 75 30 

ggt cgc gtc aac tgc atg cag gec 264 

10 Gly Arg Val Asn Cys Met Gin Ala 

85 



<210> 10 
15 <211> 88 
<212> PRT 

<213> Artificial Sequence 
<400> 10 

20 Met Ala Ser Ser Met Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala 
15 10 15 

Gin Ala Thr Met Val Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala 
20 25 30 

25 

Phe Pro Ala Thr Arg Lys Ala Asn Asn Asp lie Thr Ser lie Thr Ser 
35 40 45 

Asn Gly Gly Arg Val Asn Cys Met Gin Val Trp Pro Pro lie Gly Lys 
30 50 § 55 60 

Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro Asp Leu Thr Asp Ser Gly 
65 70 75 ' 80 

i 

35 Gly Arg Val Asn Cys Met Gin Ala 

85 



<210> 11 
40 <211> 696 
<212> DNA 

<213> Artificial Sequence 
<220> 

45 <223> Description of Artificial Sequence: CTP-AMPA 

acetyltransf erase coding sequence and amino acid 
sequence translation 

<220> 
50 <221> CDS 

<222> (1) . . (696) 

<400> 11 

atg get tec tct atg etc tct tec get act atg gtt gec tct ccg get 48 

55 Met Ala Ser Ser Met Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala 
15 10 15 
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cag gcc act atg gtc get cct ttc aac gga ctt aag tec tec get gee 
Gin Ala Thr Met Val Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala 
20 25 30 



15 65 



aag aag ttt gag act etc tct tac ctt cct gac ctt ace gat tec ggt 
Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro Asp Leu Thr Asp Ser Gly 
" ' 70 75 80 



25 aaa cag gcg gag ttt gac cac cac gcg ttt cgc gtg ggt ttt aac gcc 
Lys Gin Ala Glu Phe Asp His His Ala Phe Arg Val Gly Phe Asn Ala 
115 120 125 



aat ctg cgc gac cca aac atg cgc tac cat ctg gcg ctg ctt gat ggc 
Asn Leu Arg Asp* Pro Asn Met Arg Tyr His Leu Ala Leu Leu Asp Gly 
130 135 140 



cgc cac gac gcg cac cgt ttc tat ctg cgc gaa ggc tac gag cag age 
Arg His Asp Ala His Arg Phe Tyr Leu Arg Glu Gly Tyr Glu Gin Ser 
210 215 220 



cac ttc cgc ttc acc aag gcg ctg 
His Phe Arg Phe Thr Lys Ala Leu 
55 225 230 



96 



ttc cca gcc acc cgc aag get aac aac gac att act tec ate aca age 144 

Phe Pro Ala Thr Arg Lys Ala Asn Asn Asp He Thr Ser He Thr Ser 

35 40 45 

aac ggc gga aga gtt aac tgc atg cag gtg tgg cct ccg att gga aag 192 

Asn Gly Gly Arg Val Asn Cys Met Gin Val Trp Pro Pro He Gly Lys 
50 - 55 60 



240 



288 



ggt cgc gtc aac tgc atg cag gcc atg get get tgt gag ctt cgc ccg 
Gly Arg Val Asn Cys Met Gin Ala Met Ala Ala Cys Glu Leu Arg Pro 
85 90 95 

gcc acg cag tac gac acc gac gcg gtt tac gcg ctg att tgt gag eta 336 
Ala Thr Gin Tyr Asp Thr Asp Ala Val Tyr Ala Leu He Cys Glu Leu 
100 * 105 HO 



384 



432 



480 



gaa gtt gtc ggc atg ate ggc ctg cat ttg cag ttb cat ctg cat cat 

Glu Val Val Gly Met lie Gly Leu His Leu Gin Phe His Leu His His 
35 145 * 150 155 160 

gtc aac tgg ate ggc gaa att cag gag ttg gtg gta atg ccg cag gcg 528 

Val Asn Trp He Gly Glu He Gin Glu Leu Val Val Met Pro Gin Ala 
165 170 175 



cgc ggt ctg aac gtc ggc agt aag tta ctg gcg tgg gca gaa gaa gaa 576 

Arg Gly Leu Asn Val Gly Ser Lys Leu Leu Ala Trp Ala Glu Glu Glu 

180 185 190 

gcc cgc cag gcc ggg gcc gaa atg acc gaa ctt teg acc aac gtg aag 624 

Ala Arg Gin Ala Gly Ala Glu Met Thr Glu Leu Ser Thr Asn Val Lys 
195 200 205 



672 



696 
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10 



15 



30 



45 



<210> 12 
<211> 232 
<212> PRT 

<213> Artificial Sequence 
<400> 12 

Met Ala Ser Ser Met Leu Ser Ser Ala Thr Met Val A Ser Pro Ala 
! 5 10 15 

Gin Ala Thr Met Val Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala 
20 25 30 

Phe Pro Ala Thr Arg Lys Ala Asn Asn Asp lie Thr Ser lie Thr Ser 
35 40 45 

Asn Gly Gly Arg Val Asn Cys Met Gin Val Trp Pro Pro lie Gly Lys 
50 " 55 60 



Lys Lys Phe Glu Thr Leu Ser Tyr Leu Pro Asp Leu Thr Asp Ser Gly 
20 65 70 75 80 

Gly Arg Val Asn Cys Met Gin Ala Met Ala Ala Cys Glu Leu Arg Pro 
85 90 95 

25 Ala Thr Gin Tyr Asp Thr Asp Ala Val Tyr Ala Leu He Cys Glu Leu 
100 105 HO 

Lys Gin Ala Glu Phe Asp His His Ala Phe Arg Val Gly Phe Asn Ala 
115 120 125 



Asn Leu Arg Asp Pro Asn Met Arg Tyr His Leu Ala Leu Leu Asp Gly 
130 135 140 



Glu Val Val Gly Met He Gly Leu His Leu Gin Phe His Leu His His 
35 145 150 155 160 

Val Asn Trp He Gly Glu He Glri Glu Leu Val Val Met Pro Gin Ala 
165 170 175 

40 Arg Gly Leu Asn Val Gly Ser Lys Leu Leu Ala Trp Ala Glu Glu Glu 
180 185 190 

Ala Arg Gin Ala Gly Ala Glu Met Thr Glu Leu Ser Thr Asn Val Lys 
195 200 205 



Arg His Asp Ala His Arg Phe Tyr Leu Arg Glu Gly Tyr Glu Gin Ser 
210 215 220 



His Phe Arg Phe Thr Lys Ala Leu 
50 2 2 5 2 3 0 



<210> 13 
<211> 415 
55 <212> DNA 

<213> Zea mays 
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24 



<220> 

<221> N_region 
<222> (15) . . (163) 

5 <220> 

<221> intron 
<222> (164) . . (322) 

<220> 
10 <221> C_region 

* <222> (323) . . (4-11) 

<400> 13 

tctagaggat cagcatggcg cccaccgtga tgatggcctc gtcggccacc gccgtcgctc 6 0 

15 

cgttcctggg gctcaagtcc accgccagcc tccccgtcgc ccgccgctcc tccagaagcc 120 

tcggcaacgt cagcaacggc ggaaggatcc ggtgcatgca ggtaacaaat gcatcctagc 180 

20 tagtagttct ttgcattgca gcagctgcag ctagcgagtt agtaatagga agggaactga 240 

tgatccatgc atggactgat gtgtgttgcc catcccatcc catcccattt cccaaacgaa 300 

ccgaaaacac cgtactacgt gcaggtgtgg ccctacggca acaagaagtt cgagacgctg 360 

25 

tcgtacctgc cgccgctgtc gaccggcggg cgcatccgct gcatgcaggc catgg 415 



<210> 14 
30 <211> 174 
<212> DNA 

<213> Artificial Sequence 

i 

<220> 

35 <223> Description of Artificial Sequence: chloroplast or 
plastid transit peptide coding sequence and amino 
acid sequence translation 

<220> 
40 <221> CDS 

<222> (1) . . (174) 

<400> 14 

atg get tec tct atg etc tct tec get act atg gtt gee tct ccg get 48 

45 Met Ala Ser Ser Met Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala 
15 10 15 

cag gee act atg gtc get cct ttc aac gga ctt aag tec tec get gee 96 
Gin Ala Thr Met Val Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala 
50 20 25 30 



55 



ttc cca gee ace cgc aag get aac aac gac att act tec ate aca age 
Phe Pro Ala Thr Arg Lys Ala Asn Asn Asp He Thr Ser He Thr Ser 
35 40 45 

aac ggc gga aga gtt aac tgc atg cag gee 
Asn Gly Gly Arg Val Asn Cys Met Gin Ala 



144 



174 
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50 55 



<210> 15 
5 <211> 58 
<212> PRT 

<213> Artificial Sequence 
<400> 15 

10 Met Ala Ser Ser Met Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala 
, i 5 10 15 

Gin Ala Thr Met Val Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala 
20 25 30 



15 



35 



40 



45 



Phe Pro Ala Thr Arg Lys Ala Asn Asn Asp He Thr Ser He Thr Ser 
35 40 45 



Asn Gly Gly Arg Val Asn Cys Met Gin Ala 
20 50 55 



<210> 16 
<211> 157 
25 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: synthetic 
30 oligonucleotide representing base pairs 1 through 

157 of a 432 base pair AMPA acyltransf erase gene 

<400> 16 

atggccgctt gcgagcttcg cccagccacg cagtacgaca ccgacgccgt gtacgcgctg 60 
atctgcgagc tcaagcaggc ggagttcgac caccacgcct tccgcgtggg cttcaacgcc 12 0 
aacctgcgcg accccaacat gcgctaccat ctggcgc 157 



<210> 17 

<211> 187 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic 

oligonucleotide sequence representing base pairs 
158 through 344 of a 432 base pair AMPA 
50 acyltransf erase gene 

<400> 17 

tgcttgatgg cgaagtggtc ggcatgatcg gcctgcacct ccagttccac ctgcatcatg 60 

55 tcaactggat cggcgagatc caggagctgg tcgtgatgcc acaggcgagg ggtctgaacg 12 0 

tcggcagcaa gctcctggcg tgggccgagg aggaagccag gcaggccgga gccgagatga 18 0 
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ccgagct 187 



5 <210> 18 
<211> 88 
<212> DNA 

<213> Artificial Sequence 
10 <220> 

' <223> Description of Artificial Sequence: synthetic 

oligonucleotide sequence representing base pairs 
345 through 432 of a 432 base pair AMPA 
acyltransf erase gene 



15 



20 



<400> 18 

cagcaccaac gtgaagcgcc acgacgcgca ccgcttctac ctgcgcgaag gctacgagca 6 0 
gagccacttc cgcttcacca aggcgctg 88 



<210> 19 
<211> 432 
<212> DNA 
25 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: synthetic 

oligonucleotide providing monocot optimized coding 
30 sequence for an AMPA acetyltransf erase 

<220> * . 

<221> CDS , 
<222> (1) . . (432) 

35 

<400> 19 

atg gcc get tgc gag ctt cgc cca gec acg cag tac gac acc gac gec 48 
Met Ala Ala Cys Glu Leu Arg Pro Ala Thr Gin Tyr Asp Thr Asp Ala 
15 10 15 

40 

gtg tac gcg ctg ate tgc gag etc aag cag gcg gag ttc gac cac cac 96 
Val- Tyr Ala Leu He Cys Glu Leu Lys Gin Ala Glu Phe Asp His His 
20 25 30 

45 gcc ttc cgc gtg ggc ttc aac gcc aac ctg cgc gac ccc aac atg cgc 144 
Ala Phe Arg Val Gly Phe Asn Ala Asn Leu Arg Asp Pro Asn Met Arg 
35 40 45 

tac cat ctg gcg ctg ctt gat ggc gaa gtg gtc ggc atg ate ggc ctg 192 
50 Tyr His Leu Ala Leu Leu Asp Gly Glu Val Val Gly Met He Gly Leu 
50 55 60 

cac etc cag ttc cac ctg cat cat gtc aac tgg ate ggc gag ate cag 24 0 
His ^eu Gin Phe His Leu His His Val Asn Trp He Gly Glu He Gin 
55 6 5 7 0 7 5 8 0 

gag ctg gtc gtg atg cca cag gcg agg ggt ctg aac gtc ggc age aag 2 88 
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Glu Leu Val Val Met Pro Gin Ala Arg Gly Leu Asn Val Gly Ser Lys 
85 90 95 

etc ctg gcg tgg gec gag gag gaa gec agg cag gec gga gec gag atg 336 
5 Leu Leu Ala Trp Ala Glu Glu Glu Ala Arg Gin Ala Gly Ala Glu Met 
100 105 HO 

acc gag etc age ace aac gtg aag cgc cac gac gcg cac cgc ttc tac 384 
Thr Glu Leu Ser Thr Asn Val Lys Arg His Asp Ala His Arg Phe Tyr 
10 115 120 125 

ctg cgc gaa ggc tac gag cag age cac ttc cgc ttc acc aag gcg ctg 432 
Leu Arg Glu Gly Tyr Glu Gin Ser His Phe Arg Phe Thr Lys Ala Leu 
130 135 140 

15 

<210> 20 
<211> 144 
<212> PRT 
20 <213> Artificial Sequence 



25 



<400> 20 

Met Ala Ala Cys Glu Leu Arg Pro Ala Thr Gin Tyr Asp Thr Asp Ala 
1 1 5 10 15 

Val Tyr Ala Leu He Cys Glu Leu Lys Gin Ala Glu Phe Asp His His 
20 25 30 



Ala Phe Arg Val Gly Phe Asn Ala Asn Leu Arg Asp Pro Asn Met Arg 

30 35 t 40 45 

Tyr His Leu Ala Leu Leu Asp Gly Glu Val Val Gly Met He Gly Leu 

50 55 '60 

35 His Leu Gin Phe His Leu His His Val Asn Trp He Gly Glu He Gin 

65 70 75 80 

Glu Leu Val Val Met Pro Gin Ala Arg Gly Leu Asn Val Gly Ser Lys 
85 90 95 



40 



Leu Leu Ala Trp Ala Glu Glu Glu Ala Arg Gin Ala Gly Ala Glu Met 
100 105 HO 



Thr Glu Leu Ser Thr Asn Val Lys Arg His Asp Ala His Arg Phe Tyr 
45 115 120 125 

Leu Arg Glu Gly Tyr Glu Gin Ser His Phe Arg Phe Thr Lys Ala Leu 
130 * 135 140 



<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
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oligonucl^.otide PHN1 for use as an amplif icacion 
primer 

<400> 21 
5 atggctgctt gtgagcttcg 



<210> 22 
<211> 20 
10 <212> DNA 

* <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: synthetic 
15 oligonucleotide PHN2 for use as an amplification 

primer 

<400> 22 

cagcgccttg gtgaagcgga 

20 

<210> 23 

<211> 1630 

<212> DNA 

25 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: expression 

cassette comprising plant operable promoter linked 
30 to a coding sequence encoding an AMPA 

acetyltransferase linked to a transcription 
termination sequence ' . 

i 

<220> 
35 <221> promoter 

<222> (33) . . (605) 

<220> 

<221> transit_peptide 
40 <222> (627) . . (892) 

<220> 
<221> CDS 

<222> (893) . . (1324) 

45 

<220> 

<221> terminator 
<222> (1350) . . (1605) 

50 <400> 23 

gcggccgcgt tcaagcttga gctcaggatt tagcagcatt ccagattggg ttcaatcaac 60 

aaggtacgag ccatatcact ttattcaaat tggtatcgcc aaaaccaaga aggaactccc 120 

55 atcctcaaag gtttgtaagg aagaattctc agtccaaagc ctcaacaagg tcagggtaca 180 

gagtctccaa accattagcc aaaagctaca ggagatcaat gaagaatctt caatcaaagt 240 
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aaactactgt 


tccagcacat 


gcatcatggt 


cagtaagttt 


cagaaaaaga 


catccaccga 


300 




agacttaaag 


ttagtgggca 


tctttgaaag 


taatcttgtc 


aacatcgagc 


agctggcttg 


360 


5 


tcrrrarraccag 


acaaaaaagg 


aatggtgcag 


aattgttagg 


cgcacctacc 


aaaagcatct 


420 




ttgcctttat 


tgcaaagata 


aagcagattc 


ctctagtaca 


agtggggaac 


aaaataacgt 


480 


10 


ggaaaagagc 


tgtcctgaca 


gcccactcac 


taatgcgtat 


gacgaacgca 


gtgacgacca 


540 




caaaagaatt 


ccctctatat 


aagaaggcat 


tcattcccat 


ttgaaggatc 


atcagatact 


600 


15 


gaaccaatcc 


ttctagaaga 


tctccacaat 


ggcttcctct 


atgctctctt 


ccgctactat 


660 


ggttgcctct 


ccggctcagg 


ccactatggt 


cgctcctttc 


aacggactta 


agtcctccgc. 


720 




tgccttccca 


gccacccgca 


aggctaacaa 


cgacattact 


tccatcacaa 


gcaacggcgg 


780 


20 


aagagttaac 


tgcatgcagg 


tgtggcctcc 


gattggaaag aagaagtttg 


agactctctc 


840 




ttaccttcct 


gaccttaccg 


attccggtgg 


tcgcgtcaac 


tgcatgcagg 


cc atg get 
Met Ala 


898 



25 



946 



994 



1042 



1090 



get tgt gag ctt cgc ccg gec acg cag tac gac acc gac gcg gtt tac 
Ala Cvs Glu Leu Arg Pro Ala Thr Gin Tyr Asp Thr Asp Ala Val Tyr 
5 " 10 15 

30 gcg ctg att tgt gag eta aaa cag gcg gag ttt gac cac cac gcg ttt 
Ala Leu lie Cys Glu Leu Lys Gin Ala Glu Phe Asp His His Ala Phe 
20 25 . 30 

cgc gtg ggt ttt aac gec aat ctg cgc gac cca aac atg cgc tac cat 
35 Arg Val Gly Phe Asn Ala Asn Leu Arg Asp Pro Asn Met Arg Tyr His 
35 40 45 50 

ctg gcg ctg ctt gat ggc gaa gtt gtc ggc atg ate ggc ctg cat ttg 
Leu Ala Leu Leu Asp Gly Glu Val Val Gly Met He Gly Leu His Leu 
40 5 5 6 0 65 

cag ttt cat ctg cat cat gtc aac tgg ate ggc gaa att cag gag ttg 1138 

Gin Phe His Leu His His Val Asn Trp He Gly Glu He Gin Glu Leu 

70 75 80 

45 

gtg gta atg ccg cag gcg cgc ggt ctg aac gtc ggc agt aag tta ctg 1186 

Val Val Met Pro Gin Ala Arg Gly Leu Asn Val Gly Ser Lys Leu Leu 

85 90 95 

50 gcg tgg gca gaa gaa gaa gec cgc cag gee ggg gec gaa atg acc gaa 1234 
Ala Trp Ala Glu Glu Glu Ala Arg Gin Ala Gly Ala Glu Met Thr Glu 
100 105 HO 

ctt teg acc aac gtg aag cgc cac gac gcg cac cgt ttc tat ctg cgc 1282 
55 Leu Ser Thr Asn Val Lys Arg His Asp Ala His Arg Phe Tyr Leu Arg 
115 120 125 130 
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gaa ggc tac gag cag age cac ttc cgc ttc acc aag gcg ctg 1324 
Glu Gly Tyr Glu Gin Ser His Phe Arg Phe Thr Lys Ala Leu 
135 140 

taatgagctc ggtaceggat ccaattcccg atcgttcaaa catttggcaa taaagtttct 1384 

taagattgaa tcctgttgcc ggtcttgega tgattatcat ataatttctg ttgaatcacg 1444 

ttaagcatgt aataattaac atgtaatgca tgacgttatt tatgagatgg gtttttatga 1504 

1 ttagagtccc gcaattatac atttaatacg cgatagaaaa caaaatatag cgcgcaaact 1564 

aggataaatt atcgcgcgcg gtgtcatcta tgttactaga teggggateg atccccgggc 1624 

15 ggcege 1630 



<210> 24 

<211> 144 

20 <212> PRT 

<213> Artificial Sequence 

<400> 24 

Met Ala Ala Cys Glu Leu Arg Pro Ala Thr Gin Tyr Asp Thr Asp Ala 
25 1 5 10 15 

Val Tyr Ala Leu lie Cys Glu Leu Lys Gin Ala Glu Phe Asp His His 
20 2 5 3 0 

30 Ala Phe Arg Val Gly Phe Asn Ala Asn Leu Arg Asp Pro Asn Met Arg 
35 40 45 

Tyr His Leu Ala Leu Leu Asp Gly Glu Val Val Gly Met He Gly Leu 
50 55 60 



35 



His Leu Gin Phe His Leu His His Val Asn Trp He Gly Glu He Gin 
65 70 75 80 



Glu Leu Val Val Met Pro Gin Ala Arg Gly Leu Asn Val Gly Ser Lys 
40 85 90 95 

Leu Leu Ala Trp Ala Glu Glu Glu Ala Arg Gin Ala Gly Ala Glu Met 
100 105 110 

45 Thr Glu Leu Ser Thr Asn Val Lys Arg His Asp Ala His Arg Phe Tyr 
115 120 125 

Leu Arg Glu Gly Tyr Glu Gin Ser His Phe Arg Phe Thr Lys Ala Leu 
130 135 140 

50 

<210> 25 
<211> 2122 
<212> DNA 
55 <213> Artificial Sequence 

<220> 



WO 00/29596 



PCT/US99/27152 



-31 - 



<223> Description of Artificial Sequence : expression 
cassette comprising plant promoter linked to 
sequence encoding AMPA acetyl transferase linked 
to ^termination sequence 



<220> 

<221> promoter 
<222> (6) . . (620) 

10 <220> 

• <221> 5'UTR 

<222> (645) . . (715) 

<220> 
15 <221> intron 

<222> (729) . . (1178) 

<220> 

<221> transit_peptide 
20 <222> (1179) . . (1406) 

<220> 
<221> CDS 

<222> (1407) . . (1838) 

25 



<220> 

<221> terminator 
<222> (1849) . . (2082) 

30 cJgcaggtcc gatgtgagac ttttcaacaa agggtaatat ccggaaacct cctcggattc 60 
cattgcccag ctatctgtca ctttattgtg aagatagtgg aaaaggaagg tggctcctac 120 

35 aaatgccatc attgcgataa aggaaaggcc atcgttgaag atgcctctgc cgacagtggt 180 
cccaaagatg gacccccacc cacgaggagc atcgtggaaa aagaagacgt tccaaccacg 24 0 
tcttcaaagc aagtggattg atgtgatggt ccgatgtgag acttttcaac aaagggtaat 300 

40 atccggaaac ctcctcggat tccattgccc agctatctgt cactttattg tgaagatagt 360 
ggaaaaggaa ggtggctcct acaaatgcca tcattgcgat aaaggaaagg ccatcgttga 420 

45 agatgcctct gccgacagtg gtcccaaaga tggaccccca cccacgagga gcatcgtgga 480 
aaaagaagac gttccaacca cgtcttcaaa gcaagtggat tgatgtgata tctccactga 540 
cgtaagggat gacgcacaat cccactatcc ttcgcaagac ccttcctcta tataaggaag 600 

50 ttcatttcat ttggagagga cacgctgaca agctgactct agcagatcct ctagaaccat 660. 
cttccacaca ctcaagccac actattggag aacacacagg gacaacacac cataagatcc 720 

55 aagggaggcc tccgccgccg ccggtaacca ccccgcccct ctcctctttc tttctccgtt 780 
tttttttccg tctcggtctc gatctttggc cttggtagtt tgggtgggcg agaggcggct 840 
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tcgtgcgcgc 
gatgtagatc 

5 

tgctaaacaa 
tgctgcttcg 
10 taatccctca 
ctcgtgcgga 
gcaatggtgt 

15 

ctcccttatc 
cgtggggatt 
20 tcatgtcttc 



ccaga^cggt 
tgcgatccgc 
gatcaggaag 
tcaggcttag 
gcattgttca 
gcttttttgt 
gcagaaccca 
ggtttctctg 
gaagaagagt 
tgtttccacg 



gcgcgggagg ggcgggatct cgcggggaat ggggctctcg 900 

cgttgttggg ggagatgatg gggcgtttaa aatttcgccg 960 

aggggaaaag ggcactatgg tttatatttt tatatatttc 1020 

atgtgctaga tctttctttc ttctttttgt gggtagaatt 1080 

tcggtagttt ttcttttcat gatttcgtga caaatgcagc 1140 

aggtagaagt gatcaaccat ggcgcaagtt agcagaatct 1200 

tctcttatct ccaatctctc gaaatccagt caacgcaaat 1260 

aagacgcagc agcatccacg agcttatccg atttcgtcgt 132 0 

gggatgacgt taattggctc tgagcttcgt cctcttaagg 1380 

gcgtgc atg gcc get tgc gag ctt cgc cca gec 1433 
Met Ala Ala Cys Glu Leu Arg Pro Ala 



acg cag tac gac acc gac gcc gtg tac gcg ctg ate tgc gag etc aag 
25 Thr Gin Tyr Asp Thr Asp Ala Val Tyr Ala Leu He Cys Glu Leu Lys 
10 15 20 25 

cag gcg gag ttc gac cac cac gcc ttc cgc gtg ggc ttc aac gcc aac 
Gin Ala Glu Phe Asp His His Ala Phe Arg Val Gly Phe Asn Ala Asn 
30 ,30 35 40 

ctg cgc gac ccc aac atg cgc tac cat ctg gcg ctg ctt gat ggc gaa 

Leu Arg Asp Pro Asn Met Arg Tyr His Leu Ala Leu Leu Asp Gly Glu 

45 50 1 55 

35 

gtg gtc ggc atg ate ggc ctg cac etc cag ttc cac ctg cat cat gtc 

Val Val Gly Met He Gly Leu His Leu Gin Phe His Leu His His Val 

60 65 70 

40 aac tgg ate ggc gag ate cag gag ctg gtc gtg atg cca cag gcg agg 
Asn Trp He Gly Glu He Gin Glu Leu Val Val Met Pro Gin Ala Arg 
75 80 85 

ggt ctg aac gtc ggc age aag etc ctg gcg tgg gcc gag gag gaa gcc 
45 Gly Leu Asn Val Gly Ser Lys Leu Leu Ala Trp Ala Glu Glu Glu Ala 
90 95 100 105 

agg cag gcc gga gcc gag atg acc gag etc age acc aac gtg aag cgc 
Arg Gin Ala Gly Ala Glu Met Thr Glu Leu Ser Thr Asn Val Lys Arg 
50 110 115 120 

cac gac gcg cac cgc ttc tac ctg cgc gaa ggc tac gag cag age cac 
His Asp Ala His Arg Phe Tyr Leu Arg Glu Gly Tyr Glu Gin Ser His 
125 130 135 



55 



ttc cgc ttc acc aag gcg ctg taaagatctg aattctgeat gcgtttggac 
Phe Arg Phe Thr Lys Ala Leu 



1481 



1529 



1577 



1625 



1673 



1721 



1769 



1817 



1868 
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140 

gtatgctcat tcaggttgga gccaatttgg ttgatgtgtg tgcgagttct tgcgagtctg 1928 
5 atgagacatc tctrtattgt gtttctttcc ccagtgtttt ctgtacttgt gtaatcggct 1988 
aatcgccaac agattcggcg atgaataaat gagaaataaa ttgttctgat tttgagtgca 2048 
aaaaaaaagg aattagatct gtgtgtgttt tttggatccc cggggcggcc gccccgggtg* 2108 

10 

gtgagcttct gcag 

<210> 26 
15 <211> 144 
<212> PRT 

<213> Artificial Sequence 
<400> 26 

20 Met Ala Ala Cys Glu Leu Arg Pro Ala Thr Gin Tyr Asp Thr Asp Ala 
15 10 15 

Val Tyr Ala Leu He Cys Glu Leu Lys Gin Ala Glu Phe Asp His His 
20 25 30 



25 



Ala Phe Arg Val Gly Phe Asn Ala Asn Leu Arg Asp Pro Asn Met Arg 
35 40 45 



Tyr His Leu Ala Leu Leu Asp Gly Glu Val Val Gly Met He Gly Leu 
30 " 50 55 60 

His Leu Gin Phe His Leu His His Val Asn Trp He Gly Glu He Gin 
65 70 75 ' . 80 

35 Glu Leu Val Val Met Pro Gin Ala Arg Gly Leu Asn Val Gly Ser Lys 

85 90 95 

Leu Leu Ala Trp Ala Glu Glu Glu Ala Arg Gin Ala Gly Ala Glu Met 
100 105 HO 



40 



Thr Glu Leu Ser Thr Asn Val Lys Arg His Asp Ala His Arg Phe Tyr 
115 120 125 



Leu Arg Glu Gly Tyr Glu Gin Ser His Phe Arg Phe Thr Lys Ala Leu 
45 130 " 135 140 



<210> 27 
<211> 2378 
50 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : expression 
55 cassette comprising a plant promoter linked to an 

intron, a sequence encoding an AMPA acetyl 
transferase, and a termination sequence 
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<220> 

<22I> promoter 
<222> (28) . . (965) 

5 

<220> 

<221> intron 

<222> (966) . . (1423) 

10 <220> 

• <221> transit_peptide 
<222> (1440) . . (1667) 

<220> 
15 <221> CDS 

<222> (1668) . . (2099) 

<220> 

<221> terminator 
20 <222> (2114) . . (2369) 



<400> 27 





gatatcccta 


gggcggccgc 


gttaacaagc 


25 


agagtcggga 


tagtccaaaa 


taaaacaaag 




cagttaaaag 


gtggtataaa 


gtaaaatatc 


30 


actcttttct 


actattataa 


aaattgagga 


ttgtatgaat 


tggtttttaa 


gtttattcgc 




ggttttaagt 


tcgtttgctt 


ttgtaaatac 


35 


aaaacccata 


tgctaatttg 


acataatttt 




acaatgaaca 


ataataagat 


taaaatagct 


40 


ctagtaaaaa 


taaaagataa 


acttagactc 


cctaaagccc 


aaagtgctat 


ccacgatcca 




agcccacccc 


agtccagcca 


actggacaat 


45 


gttgtccgca 


cgcaccgcac 


gtctcgcagc 




agaaaaaaca 


gcaggtgggt 


ccgggtcgtg 


50 


cagcgacgag 


gccggccctc 


cctccgcttc 


catacccccc 


cctctcctcc 


catcccccca 




cctcctcccc 


cctcgctgcc 


ggacgacgag 


55 


cgccggtaac 


caccccgccc 


ctctcctctt 




tcgatctttg 


gccttggtag 


tttgggtggg 



ttactcgagg tcattcatat gcttgagaag 6 0 
gtaagattac ctggtcaaaa gtgaaaacat 120 
ggtaataaaa ggtggcccaa agtgaaattt 180 
tgtttttgtc ggtactttga tacgtcattt 240 
ttttggaaat gcatatctgt atttgagtcg 300 
agagggattt gtataagaaa tatctttaga 360 
tgagaaaaat atatattcag gcgaattctc 420 
ttcccccgtt gcagcgcatg ggtatttttt 480 
aaaacattta caaaaacaac ccctaaagtt 540 
tagcaagccc agcccaaccc aacccaaccc 600 
agtctccaca cccccccact atcaccgtga 660 
caaaaaaaaa aagaaagaaa aaaaagaaaa 720 
ggggccggaa acgcgaggag gatcgcgagc 780 
caaagaaacg ccccccatcg ccactatata 840 
accctaccac caccaccacc accacctcca 900 
ctcctccccc ctccccctcc gccgccgccg 960 
tctttctccg tttttttttc cgtctcggtc 1020 
cgagaggcgg cttcgtgccg cccagatcgg 1080 
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15 



25 



35 



40 



55 



tgcgcgggag gggcgggatc 


tcgcggctgg 


ctctcgcccc cgtggatccg 


gcccggatct 


1140 


cgcggggaat ,ggggctctcg 


gatgtagatc 


tgcgatccgc cgttgttggg 


gecgatgatg 


1200 


gggcccttaa 


aatttccgcc 


gtgctaaaca 


agatcaggaa gaggggaaaa 


gggcactatg 


1260 


gtttatattt 


ttatatattt 


ctgctgcttc gtcaggctta gatgtgctag 


atctttcttt 


1320 


cttctttttg 


tgggtagaat 


ttaatccctc agcattgttc atcggtagtt 


tttcttttca 


1380 


tgattcgtga 


caaatgcagc 


ctcgtgcgga 


cgtttttttg taggtagaag 


tgatcaacca 


1440 


tggcgcaagt 


tagcagaatc 


tgcaatggtg 


tgcagaaccc atctcttatc 


tccaatctct 


1500 


cgaaatccag 


tcaacgcaaa 


tctcccttat 


cggtttctct gaagacgcag 


cagcatccac 


1560 


gagcttatcc 


gatttcgtcg 


tcgtggggat 


tgaagaagag tgggatgacg 


ttaattggct 


1620 


ctgagcttcg 


tcctcttaag 


gtcatgtctt 


ctgtttccac ggcgtgc atg gcc get 
Met Ala Ala 


1676 



tgc gag ctt cgc cca gcc acg cag tac gac acc gac gcc gtg tac gcg 
Cys Glu Leu Arg Pro Ala Thr Gin Tyr Asp Thr Asp Ala Val Tyr Ala 
5 " 10 15 



ctg ate tgc gag etc aag cag gcg gag ttc gac cac cac gcc ttc cgc 
Leu He Cys Glu Leu Lys Gin Ala Glu Phe Asp His His Ala Phe Arg 
30 20 , 25 30 35 

gtg ggc ttc aac gcc aac ctg cgc gac ccc aac atg cgc tac cat ctg 
Val Gly Phe Asn Ala Asn Leu Arg Asp Pro Asn Met Arg Tyr His Leu 
40 45 • SO 



gtg atg cca cag gcg agg ggt ctg aac gtc ggc age aag etc ctg gcg 
45 Val Met Pro Gin Ala Arg Gly Leu Asn Val Gly Ser Lys Leu Leu Ala 
85 90 95 

tgg gcc gag gag gaa gcc agg cag gcc gga gcc gag atg acc gag etc 
Trp Ala Glu Glu Glu Ala Arg Gin Ala Gly Ala Glu Met Thr Glu Leu 
50 1 00 1 05 HO 115 

age acc aac gtg aag cgc cac gac gcg cac cgc ttc tac ctg cgc gaa 
Ser Thr Asn Val Lys Arg His Asp Ala His Arg Phe Tyr Leu Arg Glu 
120 125 130 



1724 



1772 



1820 



1868 



gcg ctg ctt gat ggc gaa gtg gtc ggc atg ate ggc ctg cac etc cag 
Ala Leu Leu Asp Gly Glu Val Val Gly Met He Gly Leu His Leu Gin 
55 60 65 

ttc cac ctg cat cat gtc aac tgg ate ggc gag ate cag gag ctg gtc 1916 
Phe His Leu His His Val Asn Trp He Gly Glu lie Gin Glu Leu Val 
70 75 80 



1964 



2012 



2060 



ggc tac gag cag age cac ttc cgc ttc acc aag gcg ctg taaagatctg 2109 
Gly Tyr Glu Gin Ser His Phe Arg Phe Thr Lys Ala Leu 
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135 140 

aattcccgat cgttcaaaca tttggcaata aagtttctta agattgaacc ctgttgccgg 2169 

tcttgcgatg attatcatat aatttctgtt gaattacgtt aagcatgtaa taattaacat 2229 

gtaatgcatg acgtt&t/eta tgagatgggt ttttatgatt agagtcccgc aattatacat 2289 

ttaatacgcg atagaaaaca aaatatagcg cgcaaactag gataaattat cgcgcgoggt 2349 ; 
gtcatctatg ttactagatc ggggatatc 2378 



<210> 28 
15 <211> 144 
<212> PRT 

<213> Artificial Sequence 



20 



25 



<400> 28 

Met Ala Ala Cys Glu Leu Arg Pro Ala Thr Gin Tyr Asp Thr Asp Ala 
15 10 15 

Val Tyr Ala Leu He Cys Glu Leu Lys Gin Ala Glu Phe Asp His His 
20 25 30 

Ala Phe Arg Val Gly Phe Asn Ala Asn Leu Arg Asp Pro Asn Met Arg 
35 40 45 



Tyr His Leu Ala Leu Leu Asp Gly Glu Val Val Gly Met He Gly Leu 
30 * 50 , 55 60 

His Leu Gin Phe His Leu His His Val Asn Trp He Gly Glu He Gin 
65 70 75 * 80 



35 



40 



Glu Leu Val Val Met Pro Gin Ala Arg Gly Leu Asn Val Gly Ser Lys 

85 90 95 

Leu Leu Ala Trp Ala Glu Glu Glu Ala Arg Gin Ala Gly Ala Glu Met 
100 105 HO 

Thr Glu Leu Ser Thr Asn Val Lys Arg His Asp Ala His Arg Phe Tyr 
115 120 125 



Leu Arg Glu Gly Tyr Glu Gin Ser His Phe Arg Phe Thr Lys Ala Leu 
45 130 * * 135 140 



<210> 29 
<211> 2107 
50 <212> DNA 

<213> Artificial 



<220> 

<223> Description of Artif'^ial Sequence : expression 
55 cassette comprising plant operable promoter linked 

to a leader, intron, a sequence encoding an AM PA 
acetyltransf erase, and termination sequence 
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<220> 

<221> promoter 
<222> (26) . . (590) 

5 

<220> 

<221> 5'UTR 

<222> (615) . . (685) 

10 <220> 

<221> intron 
' <222> (699) . . (1-148) 

<220> 

15 <221> transit_peptide 
<222> (1149) . . (1426) 

<220> 
<221> CDS 
20 <222> (1427) . . (1858) 

<220> 

<221> terminator 
<222> (1869) . . (2102) 

25 

<400> 29 





gcggccgcgt 


taacaagctt 


ctgcaggtcc gatgtgagac 


ttttcaacaa 


agggtaatat 


60 


30 


ccggaaacct 


cctcggattc 


cattgcccag ctatctgtca 


ctttattgtg 


aagatagtgg 


120 


aaaaggaagg 


tggctcctac 


aaatgccatc attgcgataa 


aggaaaggcc 


atcgttgaag 


180 




atgcctctgc 


cgacagtggt 


cccaaagatg gacccccacC 


cacgaggagc 


atcgtggaaa 


240 


35 


aagaagacgt 


tccaaccacg 


tcttcaaagc aagtggattg 


atgtgatggt 


ccgatgtgag 


300 




acttttcaac 


aaagggtaat 


atccggaaac ctcctcggat 


tccattgccc 


agctatctgt 


360 


40 


cactttattg 


tgaagatagt 


ggaaaaggaa ggtggctcct 


acaaatgcca 


tcattgcgat 


420 


aaaggaaagg 


ccatcgttga 


agatgcctct gccgacagtg 


gtcccaaaga 


tggaccccca 


480 




cccacgagga gcatcgtgga 


aaaagaagac gttccaacca 


cgtcttcaaa gcaagtggat 


540 


45 


tgatgtgata 


tctccactga 


cgtaagggat gacgcacaat 


cccactatcc 


ttcgcaagac 


600 




ccttcctcta 


tataaggaag 


ttcatttcat ttggagagga 


cacgctgaca agctgactct 


660 


50 


agcagatcct 


ctagaaccat 


cttccacaca ctcaagccac 


actattggag aacacacagg 


720 


gacaacacac 


cataagatcc 


aagggaggcc tccgccgccg 


ccggtaacca 


ccccgcccct 


7B0 




ctcctctttc 


tttctccgtt 


tttttttccg tctcggtctc 


gatctttggc 


cttggtagtt 


840 


55 


tgggtgggcg 


agaggcggct 


tcgtgcgcgc cc agatcggt 


gcgcgggagg 


ggc;ggatct 


900 




cgcggggaat 


ggggctctcg 


gatgtagatc tgcgatccgc 


cgttgttggg ggagatgatg 


960 
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gggcgtttaa 
tttatatttt 

5 

ttctttttgt 
gatttcgtga 
10 ggcgcaagtt 
gaaatccagt 
agcttatccg 
tgagcttcgt 



15 



aatttcgccg 
tatatatttc 
gggtagaatt 
caaatgcagc 
agcagaatct 
caacgcaaat 
atttcgtcgt 
cctcttaagg 



tgctaaacaa 
tgctgcttcg 
taatccctca 
ctcgtgcgga 
gcaatggtgt 
ctcccttatc 
cgtggggatt 
tcatgtcttc 



gatcaggaag 
tcaggcttag 
gcattgttca 
gcttttttgt 
gcagaaccca 
ggtttctctg 
gaagaagagt 
tgtttccacg 



aggggaaaag 

atgtgctaga 

tcggtagttt 

aggtagaagt 

tctcttatct 

aagacgcagc 

gggatgacgt 

gcgtgc atg 
Met 



ggcactatgg 1020 
tctttctttc 1080 
ttcttttcat 1140 
gatcaaccat 1200 
ccaatctctc 1260 
agcatccacg 1320 
taattggctc 1380 



gcc get 
Ala Ala 



1435 



20 tgc gag ctt cgc cca gcc acg cag tac gac acc gac gcc gtg tac gcg 

Cys Glu Leu Arg Pro Ala Thr Gin Tyr Asp Thr Asp Ala Val Tyr Ala 

5 10 15 

ctg ate tgc gag etc aag cag gcg gag ttc gac cac cac gcc ttc cgc 

25 Leu He Cys Glu Leu Lys Gin Ala Glu Phe Asp His His Ala Phe Arg 

20 * 25 30 35 



30 



35 



gtg ggc ttc aac gcc aac ctg cgc gac ccc aac atg cgc tac cat ctg 
Val Gly Phe Asn Ala Asn Leu Arg Asp Pro Asn Met Arg Tyr His Leu 
.40 45 50 

gcg ctg ctt gat ggc gaa gtg gtc ggc atg ate g^c ctg cac etc cag 
Ala Leu Leu Asp Gly Glu Val Val Gly Met He Gly Leu His Leu Gin 
55 60 65 

ttc cac ctg cat cat gtc aac tgg ate ggc gag ate cag gag ctg gtc 
Phe His Leu His His Val Asn Trp He Gly Glu He Gin Glu Leu Val 
70 75 80 



40 gtg atg cca cag gcg agg ggt ctg aac gtc ggc age aag etc ctg gcg 
Val Met Pro Gin Ala Arg Gly Leu Asn Val Gly Ser Lys Leu Leu Ala 
85 90 95 

tgg gcc gag gag gaa gcc agg cag gcc gga gcc gag atg acc gag etc 
45 Trp Ala Glu Glu Glu Ala Arg Gin Ala Gly Ala Glu Met Thr Glu Leu 
100 105 110 115 



age acc aac gtg aag cgc cac gac gcg cac cgc ttc tac ctg cgc gaa 
Ser Thr Asn Val Lys Arg His Asp Ala His Arg Phe Tyr Leu Arg Glu 
120 125 130 



50 



55 



1483 



1531 



1579 



1627 



1675 



1723 



1771 



1819 



1868 



ggc tac gag cag age cac ttc cgc ttc acc aag gcg ctg taaagatctg 
Gly Tyr Glu Gin Ser His Phe Arg Phe Thr Lys Ala Leu 
135 140 

aattctgeat gcgtttggac gtatgetcat tcaggttgga gecaatttgg ttgatgtgtg 1928 
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tgcgagttct tgcgagtctg atgagacatc tctgtattgt gtttctttcc ccagtgtttt 198B 
ctgtacttgt gtaatcggct aatcgccaac agattcggcg atgaataaat gagaaataaa 2048 
5 ttgttctgat N tttgagtgca aaaaaaaagg aattagatct gtgtgtgttt tttggatcc 2107 



<210> 30 
<211> 144 
10 <212> PRT 

<213> Artificial Sequence 

<400> 30 

Met Ala Ala Cys Glu Leu Arg Pro Ala Thr Gin Tyr Asp Thr Asp Ala 
15 1 5 10 15 

Val Tyr Ala Leu lie Cys Glu Leu Lys Gin Ala Glu Phe Asp His His 
20 25 30 

20 Ala Phe Arg Val Gly Phe Asn Ala Asn Leu Arg Asp Pro Asn Met Arg 
35 40 45 

Tyr His Leu Ala Leu Leu Asp Gly Glu Val Val Gly Met He Gly Leu 
50 55 60 



25 



35 



40 



His Leu Gin Phe His Leu His His Val Asn Trp He Gly Glu He Gin 
65 70 75 80 



Glu Leu Val Val Met Pro Gin Ala Arg Gly Leu Asn Val Gly Ser Lys 

30 85 90 95 

Leu Leu Ala Trp Ala Glu Glu Glu Ala Arg Gin Ala Gly Ala Glu Met 
100 105 ' • HO 



Thr Glu Leu Ser Thr Asn Val Lys Arg His Asp Ala His Arg Phe Tyr 

115 120 125 

Leu Arg Glu Gly Tyr Glu Gin Ser His Phe Arg Phe Thr Lys Ala Leu 
130 135 140 



<210> 31 

<211> 2436 

<212> DNA 

45 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence :monocot 

expression cassette comprising plant operable 
50 promoter linked to an intron, a sequence coding 

for an AMPA acetyltransf erase, and a termination 
sequence 

<220> 
55 <221> promoter 

<222> (26) . . (640) 
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25 



<220> 

<221> intron 

<222> (670) . . (1473) 

5 <220> 

<221> transit_peptide 
<222> (1498) . . (1725) 

<220> 
10 <221> CDS 

1 <222> (1726) . .J2157) 

<220> 

<221> terminator 
15 <222> (2172) . . (2427) 

<400> 31 

gcggccgcgt taacaagctt ctgcaggtcc gatgtgagac ttttcaacaa agggtaatat 60 
20 ccggaaacct cctcggattc cattgcccag ctatctgtca ctttattgtg aagatagtgg 120 
aaaaggaagg tggctcctac aaatgccatc attgcgataa aggaaaggcc atcgttgaag 180 
atgcctctgc cgacagtggt cccaaagatg gacccccacc cacgaggagc atcgtggaaa 240 
aagaagacgt tccaaccacg tcttcaaagc aagtggattg atgtgatggt ccgatgtgag 300 
acttttcaac aaagggtaat atccggaaac ctcctcggat tccattgccc agctatctgt 360 
30 cactttattg tgaagatagt ggaaaaggaa ggtggctcct acaaatgcca tcattgcgat 420 
aaaggaaagg ccatcgttga agatgcctct gccgacagtsg gtcccaaaga tggaccccca 480 
cccacgagga gcatcgtgga aaaagaagac gttccaacca cgtcttcaaa gcaagtggat 54 0 
35 tgatgtgata tctccactga cgtaagggat gacgcacaat cccactatcc ttcgcaagac 600 
ccttcctcta tataaggaag ttcatttcat ttggagagga cacgctgaca agctgactct 660 
40 agcagatcta ccgtcttcgg tacgcgctca ctccgccctc tgcctttgtt actgccacgt 720 
ttctctgaat gctctcttgt gtggtgattg ctgagagtgg tttagctgga tctagaatta 780 
cactctgaaa tcgtgttctg cctgtgctga ttacttgccg tcctttgtag cagcaaaata 840 
tagggacatg gtagtacgaa acgaagatag aacctacaca gcaatacgag aaatgtgtaa 900 
tttggtgctt agcggtattt atttaagcac atgttggtgt tatagggcac ttggattcag 960 
50 aagtttgctg ttaatttagg cacaggcttc atactacatg ggtcaatagt atagggattc 1020 
atattatagg cgatactata ataatttgtt cgtctgcaga gcttattatt tgccaaaatt 108 0 
agatattcct attctgtttt tgtt jtgtg ctgttaaatt gttaacgcct gaaggaataa 1140 
atataaatga cgaaattttg atgtttatct ctgctccttt attgtgacca taagtcaaga 1200 



45 



55 
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tcagata-ac ttgttttaaa tattgttgtc tgaagaaata agtactgaca gtactttgac 1260 

gcattgatct gcttgtttgt tgtaacaaaa tttaaaaata aagagtttcc tttttgctgc 1320 

tctccttacc tcctgatggt atctagtatc taccaactga cactatattg cttctcttta 1380 

catacgtatc ttgctcgatg ccttctccct agtgttgacc agtgttactc acatagtctt 1440 

tgctcatttc attgtaatgc agataccaag cggcctctag aggatccagg agcaaccatg 1500 f 

gcgcaagtta gcagaatctg caatggtgtg cagaacccat ctcttatctc caatctctcg 1560 

aaatccagtc aacgcaaatc tcccttatcg gtttctctga agacgcagca gcatccacga 1620 

gcttatccga tttcgtcgtc gtggggattg aagaagagtg ggatgacgtt aattggctct 1680 

qagcttcgtc ctcttaaggt catgtcttct gtttccacgg cgtgc atg gcc get tgc 1737 
33 ~ " " Met Ala Ala Cys 

gag ctt cgc cca gcc acg cag tac gac acc gac gcc gtg tac gcg ctg 1785 
Glu Leu Arg Pro Ala Thr Gin Tyr Asp Thr Asp Ala Val Tyr Ala Leu 
5 10 15 20 

ate tgc gag etc aag cag gcg gag ttc gac cac cac gcc ttc cgc gtg 
He Cys Glu Leu Lys Gin Ala Glu Phe Asp His His Ala Phe Arg Val 
25 30 35 

ggc ttc aac gcc aac ctg cgc gac ccc aac atg cgc tac cat ctg gcg 
Gly Phe Asn Ala Asn Leu Arg Asp Pro Asn Met Arg Tyr His Leu Ala 
40 45 50 

ctg ctt gat ggc gaa gtg gtc ggc atg ate ggc ctg cac etc cag ttc 
Leu Leu Asp Gly Glu Val Val Gly Met lie Gly Leu His Leu Gin Phe 
55 60 65 

cac ctg cat cat gtc aac tgg ate ggc gag ate cag gag ctg gtc gtg 
His Leu His His Val Asn Trp He Gly Glu He Gin Glu Leu Val Val 
70 75 80 

atg cca cag gcg agg ggt ctg aac gtc ggc age aag etc ctg gcg tgg 
Met Pro Gin Ala Arg Gly Leu Asn Val Gly Ser Lys Leu Leu Ala Trp 
85 90 95 100 

gcc gag gag gaa gcc agg cag gcc gga gcc gag atg acc gag etc age 
Ala Glu Glu Glu Ala Arg Gin Ala Gly Ala Glu Met Thr Glu Leu Ser 
105 HO 115 

acc aac gtg aag cgc cac gac gcg cac cgc ttc tac ctg cgc gaa ggc 
Thr Asn Val Lys Arg His Asp Ala His Arg Phe Tyr Leu Arg Glu Gly 
120 125 130 

tac gag cag age cac ttc cgc ttc acc aag gcg ctg taaagatctg 
Tyr Glu Gin Ser His Phe Arg Phe Thr Lys Ala Leu 
135 140 

aattcccgat cgttcaaaca tttggcaata aagtttctta agattgaatc ctgttgccgg 2227 



1833 



1881 



1929 



1977 



2025 



2073 



2121 



2167 
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tcttgcgatg attatcatat aatttctgtt gaattacgtt aagcatgtaa taattaacat 2287 
gtaatgcatg acgttattta tgagatgggt ttttatgatt agagtcccgc aattatacat 2347 

5 

ttaatacgcg atagaaaaca aaatatagcg cgcaaactag gataaattat: cgcgcgcggt 24 07 
gtcatctatg ttactagatc ggggatatc „ 2436 

10 

' <210> 32 
<211> 144 
<212> PRT 

<213> Artificial Sequence 

15 

<400> 32 

Met Ala Ala Cys Glu Leu Arg Pro Ala Thr Gin Tyr Asp Thr Asp Ala 
1 5 10 15 

20 Val Tyr Ala Leu He Cys Glu Leu Lys Gin Ala Glu Phe Asp His His 
20 25 30 

Ala Phe Arg Val Gly Phe Asn Ala Asn Leu Arg Asp Pro Asn Met Arg 
35 40 45 

25 

Tyr His Leu Ala Leu Leu Asp Gly Glu Val Val Gly Met He Gly Leu 
50 55 60 

His Leu Gin Phe His Leu His His Val Asn Trp He Gly Glu He Gin 
30 65 ' 70 75 80 

Glu Leu Val Val Met Pro Gin Ala Arg Gly Leu Asn Val Gly Ser Lys 
85 90 , 95 

35 Leu Leu Ala Trp Ala Glu Glu Glu Ala Arg Gin Ala Gly Ala Glu Met 
100 105 HO 

Thr Glu Leu Ser Thr Asn Val Lys Arg His Asp Ala His Arg Phe Tyr 
115 120 125 

40 

Leu Arg Glu Gly Tyr Glu Gin Ser His Phe Arg Phe Thr Lys Ala Leu 
130 135 140 
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